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The aim of physics is to write down the Hamiltonian of 
the universe. The rest is mathematics. 


Mathematics wants to discover and investigate universal 
structures. Which of them are realized in nature is left to 
physics. 


Preface 


Perhaps, this is a bad book. As a mathematician, you will not fin a systematic 
theory with complete proofs, and, even worse, the standards of rigor established for 
mathematical writing will not always be maintained. As a physicist, you will not 
fin coherent computational schemes for arriving at predictions. 

Perhaps even worse, this book is seriously incomplete. Not only does it fall short 
of a coherent and complete theory of the physical forces, simply because such a the- 
ory does not yet exist, but it also leaves out many aspects of what is already known 
and established. 

This book results from my fascination with the ideas of theoretical high energy 
physics that may offer us a glimpse at the ultimate layer of reality and with the 
mathematical concepts, in particular the geometric ones, underlying these ideas. 

Mathematics has three main subfields analysis, geometry and algebra. Analysis 
is about the continuum and limits, and in its modern form, it is concerned with quan- 
titative estimates establishing the convergence of asymptotic expansions, infinit se- 
ries, approximation schemes and, more abstractly, the existence of objects define in 
infinite-dimensiona spaces, by differential equations, variational principles, or other 
schemes. In fact, one of the fundamental differences between modern physics and 
mathematics is that physicists usually are satisfie with linearizations and formal 
expansions, whereas mathematicians should be concerned with the global, nonlin- 
ear aspects and prove the convergence of those asymptotic expansions. In this book, 
such analytical aspects are usually suppressed. Many results have been established 
through the dedicated effort of generations of mathematicians, in particular by those 
among them calling themselves mathematical physicists. A systematic presentation 
of those results would require a much longer book than the present one. Worse, in 
many cases, computations accepted in the physics literature remain at a formal level 
and have not yet been justifie by such an analytical scheme. A particular issue 
is the relationship between Euclidean and Minkowski signatures. Clearly, relativ- 
ity theory, and more generally, relativistic quantum fiel theory require us to work 
in Lorentzian spaces, that is, ones with an indefinit metric, and the corresponding 
partial differential equations are of hyperbolic type. The mathematical theory, how- 
ever, is easier and much better established for Riemannian manifolds, that is, for 
spaces with positive definit metrics, and for elliptic partial differential equations. 


vii 
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In the physics literature, therefore, one often carries through the computations in 
the latter situation and appeals to a principle of analytic continuation, called Wick 
rotation, that formally extends the formulae to the Lorentzian case. The analytical 
justificatio of this principle is often doubtful, owing, for example, to the profound 
difference between nonlinear elliptic and hyperbolic partial differential equations. 
Again, this issue is not systematically addressed here. 

Algebra is about the formalism of discrete objects satisfying certain axiomatic 
tules, and here there is much less conflic between mathematics and physics. In 
many instances, there is an alternative between an algebraic and a geometric ap- 
proach. The present book is essentially about the latter, geometric, approach. Geom- 
etry is about qualitative, global structures, and it has been a remarkable trend in 
recent decades that some physicists, in particular those considering themselves as 
mathematical physicists (in contrast to the mathematicians using the same name 
who, as mentioned, are more concerned with the analytical aspects), have employed 
global geometric concepts with much success. At the same time, mathematicians 
working in geometry and algebra have realized that some of the physical concepts 
equip them with structures that are at the same time rich and tightly constrained and 
thereby afford powerful tools for probing old and new questions in global geometry. 

The aim of the present book is to present some basic aspects of this powerful in- 
terplay between physics and geometry that should serve for a deeper understanding 
of either of them. We try to introduce the important concepts and ideas, but as men- 
tioned, the present book neither is completely systematic nor analytically rigorous. 
In particular, we describe many mathematical concepts and structures, but for the 
proofs of the fundamental results, we usually refer to other sources. This keeps the 
book reasonably short and perhaps also aids its coherence. — For a much more sys- 
tematic and comprehensive presentation of the fundamental theories of high-energy 
physics in mathematical terms, I wish to refer to the forthcoming 6-volume treatise 
[111] of my colleague Eberhard Zeidler. 

As you will know, the fundamental problem of contemporary theoretical physics! 
is the unificatio of the physical forces in a single, encompassing, coherent “The- 
ory of Everything”. This focus on a single problem makes theoretical physics more 
coherent, and perhaps sometimes also more dynamic, than mathematics that tradi- 
tionally is subdivided into many field with their own themes and problems. In turn, 
however, mathematics seems to be more uniform in terms of methodological stan- 
dards than physics, and so, among its practioners, there seems to be a greater sense 
of community and unity. 

Returning to the physical forces, there are the electromagnetic, weak and strong 
interactions on one hand and gravity on the other. For the firs three, quantum fiel 
theory and its extensions have developed a reasonably convincing, and also rather 
successful unifie framework. The latter, gravity, however, more stubbornly resists 
such attempts at unification Approaches to bridge this gap come from both sides. 
Superstring theory is the champion of the quantum camp, ever since the appearance 


'More precisely, we are concerned here with high-energy theoretical physics. Other f elds, like 
solid-state or statistical physics, have their own important problems. 
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of the monograph [50] of Green, Schwarz and Witten, but many people from the 
gravity camp seem unconvinced? and propose other schemes. Here, in particular 
Ashtekar’s program should be mentioned (see e.g. [92]). The different approaches 
to quantum gravity are described and compared in [74]. A basic source of the dif- 
ficultie that these two camps are having with each other is that quantum theory 
does not have an ontology, at least according to the majority view and in the hands 
of its practioners. It is solely concerned with systematic relations between observa- 
tions, but not with any underlying reality, that is, with laws, but not with structures. 
General relativity, in contrast, is concerned with the structure of space-time. Its 
practioners often consider such ideas as extra dimensions, or worse, tunneling be- 
tween parallel universes, that are readily proposed by string theorists, as too fanciful 
flight of the imagination, as some kind of condensed metaphysics, rather than as 
honest, experimentally verifiabl physics. Mathematicians seem to have fewer diffi 
culties with this, as they are concerned with structures that are typically believed to 
constitute some higher form of ‘Platonic’ reality than our everyday experience. In 
the present book, I approach things from the quantum rather than from the relativity 
side, not because of any commitment at a philosophical level, but rather because 
this at present offers the more exciting mathematical perspectives. However, this is 
not meant to deny that general relativity and its modern extensions also lead to deep 
mathematical structures and challenging mathematical problems. 

While I have been trained as a mathematician and therefore naturally view things 
from a structural, mathematical rather than from a computational, physical perspec- 
tive, nevertheless I often fin the physicists’ approach more insightful and more to 
the point than the mathematicians’ one. Therefore, in this book, the two perspectives 
are relatively freely mixed, even though the mathematical one remains the dominant 
one. Hopefully, this will also serve to make the book accessible to people with either 
background. In particular, also the two topics, geometry and physics, are interwoven 
rather than separated. For instance, as a consequence, general relativity is discussed 
within the geometry part rather than the physics one, because within the structure of 
this book, it fit into the geometry chapter more naturally. 

In any case, in mathematics, there is more of a tradition of explaining theoretical 
concepts, and good examples of mathematical exposition can provide the reader 
with conceptual insights instead of just a heap of formulae. Physicists seem to make 
fewer attempts in this direction. I have tried to follow the mathematical style in this 
regard. 

I have assembled a representative (but perhaps personally biased) bibliography, 
but I have made no attempt at a systematic and comprehensive one. In the age of the 
Arxiv and googlescholar, such a scholarly enterprise seems to have lost its useful- 
ness. In any case, I am more interested in the formal structure of the theory than in 
its historical development. Therefore, the (rather few) historical claims in this book 
should be taken with caution, as I have not checked the history systematically or 
carefully. 


?For an eloquent criticism, see for example Penrose [85]. 
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Chapter 1 
Geometry 


1.1 Riemannian and Lorentzian Manifolds 


1.1.1 Differential Geometry 


We collect here some basic facts and principles of differential geometry as the foun- 
dation for the sequel. For a more penetrating discussion and for the proofs of var- 
ious results, we refer to [65]. Classical differential geometry as expressed through 
the tensor calculus is about coordinate representations of geometric objects and the 
transformations of those representations under coordinate changes. The geometric 
objects are invariantly defined but their coordinate representations are not, and re- 
solving this contradiction is the content of the tensor calculus. 

We consider a d-dimensional differentiable manifold M (assumed to be con- 
nected, oriented, paracompact and Hausdorff) and start with some conventions: 


1. Einstein summation convention 


d 


di= > adi, (1.1.1) 


i=l] 


The content of this convention is that a summation sign is omitted when the same 
index occurs twice in a product, once as an upper and once as a lower index. This 
rule is not affected by the possible presence of other indices; for example, 


d 
ABS) Kb. (1.1.2) 
j=l 


The conventions about when to place an index in an upper or lower position will 
be given subsequently. One aspect of this, however, is: 

2. When G = (g;;)j,; is a metric tensor (a notion to be explained below) with in- 
dices i, j, the inverse metric tensor is written as G7! = (g!4)i,;, that is, by raising 
the indices. In particular 


2 1 wheni=k 
Voun=si i= , 1.1.3 
ewe ‘ 0 wheni +k, ( ) 


the so-called Kronecker symbol. 
3. Combining the previous rules, we obtain more generally 
i 


v=giy; and vj=gijv!. (1.1.4) 
Jj j 
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4. For d-dimensional scalar quantities (¢ Maes ¢%), we can use the Euclidean met- 
ric 6;; to freely raise or lower indices in order to conform to the summation 
convention, that is, 


i = dijo! = ¢". (E13) 


A (finite-dimensional manifold M is locally modeled after R¢. Thus, locally, it 
can be represented by coordinates x = (x!,..., x“) taken from some open subset 
of R?. These coordinates, however, are not canonical, and we may as well choose 
other ones, y = (y!,..., y2), with x = f (y) for some homeomorphism f. When the 
manifold M is differentiable—as always assumed here—we can cover it by local co- 
ordinates in such a manner that all such coordinate transitions are diffeomorphisms 
where defined Again, the choice of coordinates is non-canonical. The basic content 
of classical differential geometry is to investigate how various expressions repre- 
senting objects on M like tangent vectors transform under coordinate changes. Here 
and in the sequel, all objects define on a differentiable manifold will be assumed 
to be differentiable themselves. This is checked in local coordinates, but since coor- 
dinate transitions are diffeomorphic, the differentiability property does not depend 
on the choice of coordinates. 


Remark For our purposes, it is often convenient, and in the literature, it is custom- 
ary, to mean by “differentiability” smoothness of class C™, that is, to assume that all 
objects are infinitel often differentiable. The ring of (infinitel often) differentiable 
functions on M is denoted by C°(M). Nonetheless, at certain places where analy- 
sis is more important, we need to be more specifi about the regularity classes of the 
objects involved. But for the moment, we shall happily assume that our manifold 
is of class C®. 


A tangent vector for M at some point p represented by xo in local coordinates! 
x is an expression of the form 


an) 
Va=u'—. 1.1.6 
Vo (1.1.6) 
This means that it operates on a function ¢(x) in our local coordinates as 
0 
V(o) (x0) = v' id : (1.1.7) 
ox! |x=xo 


The summation convention (1.1.1) applies to (1.1.7). The i in —2, is considered to 


ax! 
be a lower index since it appears in the denominator. 
The tangent vectors at p € M form a vector space, called the tangent space T,M 


of M at p. A basis of T,,M is given by the 4, considered as derivative operators 


'We shall not always be so careful in distinguishing a point p as an invariant geometric object from 
its representation x9 in some local coordinates, but frequently identify p and xo without alerting 
the reader. 
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at the point p represented by xo in the local coordinates, as in (1.1.7).? Whereas, 
as should become clear subsequently, this tangent space and its tangent vectors are 
define independently of the choice of local coordinates, the representation of a tan- 
gent space does depend on those coordinates. The question then is how the same 
tangent vector is represented in different local coordinates y with x = f(y) as be- 
fore. The answer comes from the requirement that the result of the operation of the 
tangent vector V ona function ¢, V(@), be independent of the choice of coordinates. 
Always applying the chain rule, here and in the sequel, this yields 


jayk a 
ax! ayk- 


V=u (1.1.8) 


. . . ; k . . . 
Thus, the coefficient of V in the y-coordinates are v’ oy This is verifie by the 
following computation: 


ay* a dyk ab axis. xd A a) 
Y 4G at 9 ag a 
ax! dy 


u! = =—— = 
ax! axs dayk ox! Ox/ ox! 


(1.1.9) 


as required. 

More abstractly, changing coordinates by f pulls a function ¢ define in the x- 
coordinates back to f*@ define for the y-coordinates, with f*¢(y) = @(f(y)). If 
then W = w* is a tangent vector written in the y-coordinates, we need to push it 
forward as 


feW = wt — — (1.1.10) 


to the x-coordinates, to have the invariance 


(f.W)(b) = W( Ff") (1.1.11) 
which is easily checked: 
_ , ox! ag ee a _ * 
(f,W)b = w ayk axt aye? (FO) = W(f"¢). (1.1.12) 


In particular, there is some duality between functions and tangent vectors here. How- 
ever, the situation is not entirely symmetric. We need to know the tangent vector 
only at the point x9 where we want to apply it, but we need to know the function @ 
in some neighborhood of xo because we take its derivatives. 

A vector fiel is then define as V(x) =v! x)%, that is, by having a tangent 
vector at each point of M. As indicated above, we assume here that the coefficient 
v! (x) are differentiable. The vector space of vector field on M is written as P(TM). 


(In fact, "(7 M) is a module over the ring C™(M).) 


?As here, we shall usually simply write ea in place of so (p) or ea (xo), that is, we assume that 
the point where a derivative operator acts 1s clear from the context or the coefficient 
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Later, we shall need the Lie bracket [V, W] := VW — WYV of two vector field 
Vix) =v (x) 4, W(x) = w! (x) 4 its operation on a function ¢ is 


ax? axT? 
— eee ae ae oe 
Ve WGC) a NS ee) re) Ye 
— (4,,0wlx) avi (x)\ ab (x) 
= (o (x) Au w' (x) ae ) aa (1.1.13) 


In particular, for coordinate vector fields we have 


nr) 


Returning to a single tangent vector, V = v! a at some point x9, we consider a cov- 


ector or cotangent vector w = w;dx' at this point as an object dual to V, with the 
tule 


i( 9 i 
dx ay = 5) (1.1.15) 
yielding 


ae 1 . 
ajdx' Ge =v! 5. =ajv'. (1.1.16) 
Ox/ J 


This expression depends only on the coefficient v! and @; at the point under con- 
sideration and does not require any values in a neighborhood. We can write this as 
w(V), the application of the covector w to the vector V, or as V(w), the application 
of V to w. 

The cotangent vectors at p likewise constitute a vector space, the cotangent 
space te M. 

We have the transformation behavior 


= ax! 
Ya 


dx! dy* (1.1.17) 


required for the invariance of w(V). Thus, the coefficient of w in the y-coordinates 
are given by the identity 


* dy”, (1.1.18) 
Yo 


a 
ajdx' =a; 
a) 
Again, a covector w;dx' is pulled back under a map /: 
; ax! 
f*(@dx!) = 0; ay". (1.1.19) 
dy 


The transformation rules (1.1.10), (1.1.19) apply to arbitrary maps f : M— N from 
M into a possibly different manifold N, not only to coordinate changes or diffeo- 
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morphisms. So, we can always pull back a function or a covector and always push 
forward a vector under a map, but not always the other way around. 

The transformation behavior of a tangent vector as in (1.1.8) is called contravari- 
ant, the opposite one of a covector as (1.1.18) covariant. 

A 1|-form then assigns a covector to every point in M, and thus, it is locally given 
as w;(x)dx!. 

Having derived the transformation of vectors and covectors, we can then also de- 
termine the transformation rules for other tensors. A lower index always indicates 
covariant, an upper one contravariant transformation. For example, the metric 
tensor, written as gijax' ®@ dx/,> with gij = (2 ; =) being the inner product of 
those two basis vectors, operates on pairs of tangent vectors. It therefore transforms 
doubly covariantly, that is, becomes 


Laer F @ dy®. (1.1.20) 


si FONG 


The purpose of the metric tensor is to provide a Euclidean product of tangent vec- 
tors, 


(V, W) = giv w! (1.1.21) 


for V =v! oo, W=w! 4 . As acheck, in this formula, v! and w! transform con- 
travariantly, while g;; transforms doubly covariantly, so that the product as a scalar 
quantity remains invariant under coordinate transformations. 

Similarly, we obtain the product of two covectors w,a € TM as 


(w, «) = g' ajar. (1.1.22) 
We next introduce the concept of exterior p-forms and put 


AP := AP(TZM):=TIM A---ATIM _ (exterior product). (1.1.23) 
ae 
p times 


On A? (TM), we have the exterior product with n «€ T7M = iy (T7M): 


A?(T*M) —> A?*1(T*M) 
(1.1.24) 
Or> €(Nhoa:=nNnAw. 


An exterior p-form is a sum of terms of the form 


w(x) = n(x)dx!! A+++ A dx! 


3Subsequently, we shall mostly leave out the symbol ®, that is, write simply g; jax'dx/ in place 
of gijdx! @ dx/. 
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where n(x) is a smooth function and (x!,...,x%) are local coordinates. That is, 
a p-form assigns an element of A?(T;*M) to every x € M. The space of exterior 
p-forms is denoted by Q?(M). 

When M carries a Riemannian metric gijdx! ® dx/, the scalar product on the 
cotangent spaces TM induces one on the spaces A? (TM) by 


(dx!) A.» Adx!?, dx}! A--- A dx?) := det((dx'", dx/’)) (1.1.25) 


and linear extension. 
Given a Riemannian metric g;;dx' @ dx/, also, in local coordinates, we can 
defin the volume form 


dvolg := ,/det(gij)dx'! A---Adx4, (1.1.26) 


This volume form depends on an ordering of the indices 1, 2,..., d of the local co- 
ordinates: since the exterior product is antisymmetric, dx! A dx/i = —dx/ A dx', 
it changes its sign under an odd permutation of the indices. Thus, when we have 
a coordinate transformation x = f(y) where the Jacobian determinant det( $7) is 
negative, dvol changes its sign; otherwise, it is invariant. Therefore, in order to have 
a globally define volume form on the Riemannian manifold M, we need to exclude 
coordinate changes with negative Jacobian. The manifold M is called oriented when 
it can be covered by coordinates such that all coordinate changes have a positive Ja- 
cobian. In that case, the volume form is well defined and we can defin the integral 
of a function ¢ on M by 


/ $(x) dvol(x). (1.1.27) 


We shall therefore assume the manifold M to be oriented whenever we carry out 
such an integral. We can then also defin the L?-product of p-forms w, a € Q?(M): 


(@, a) := row. a(x))dvolg(x). (1.1.28) 
We now assume that the dimension d = 4, the case of particular importance for the 


application of our geometric concepts to physics. Then when w is a 2-form, w A @ 
is a 4-form. We call w self-dual or antiself-dual when the + resp. — sign holds in 


®ONO=+(a,w)dvol,. (1.1.29) 
When w+ is self-dual, and w_ antiself-dual, we have 
(w4,@-) =0 (1.1.30) 


that is, the spaces of self-dual and antiself-dual forms are orthogonal to each other. 
Every 2-form w on a 4-manifold can be decomposed as the sum of a self-dual and 
an antiself-dual form, 


WO=01,+0_. (1.1.31) 
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We return to arbitrary dimension d. 


Definitio 1.1 The exterior derivative d : Q?(M) — Q?+!(M) (p=0,..., dim M) 
is define through the formula 


3 
d(n(x)dx!! A... Adxir) = wr ax i ndxtt rn... ndxir (1.1.32) 


and extended by linearity to all of 2?(M). 


The exterior derivative enjoys the following product rule: If w € Q?(M), v0 € 
Q7(M), then 
d(@A0)=dort+(-1)?aadb, (1.1.33) 


from the formula w A 3 = (—1)?40 Aw and (1.1.32). 
Let x = f(y) be a coordinate transformation, 


w(x) =n(x)dx!! A+. A dx!? € QP(M),. 


In the y-coordinates, we then have 


i 
dy? (1.1.34) 


M1 AL A * 
oy 


which is the transformation formula for p-forms. The exterior derivative is compat- 
ible with this transformation rule: 


d(f*(w)) = f* (do), (1.1.35) 
which follows from the transformation invariance 


an) 47 SUF O) a a 9 FO) ja 


axd axd aya? dy” eee) 


Thus, d is independent of the choice of coordinates. d satisfie the following impor- 
tant rule: 


Lemma 1.1 


dod=0. (1.1.37) 


Proof We check (1.1.37) for forms of the type 


w(x) = f(x)dx!! A... A dx!? 
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from which it extends by linearity to all p-forms. Now 


dod(a(x)) = a( ax! Ade! Aw K as) 
x 


OF ak ari adx! 
= : pax A dx! Adx' A-+»Adx'? =0, 
Ox) Ox 
Of SEE. j _ yk j 
since —74—¢ = 4,kj,7 and dx- A dxk dx" Adx!, 


In the preceding, we have presented one possible way of conceptualizing trans- 
formations, the one employed by mathematicians: The same point p is written in dif- 
ferent coordinate systems x and y, which are then functionally related by x = x(y). 
Another view of transformations, often taken in the physics literature, is to move the 
point p and consider the induced effect on tensors. Let us discuss the example of a 
1-form w(x)dx. Within the fi ed coordinates x, we vary the points represented by 
these coordinates by 


xhex+e&(x) =x + €5x (1.1.38) 


for some map & and some small parameter €, and we want to take the limit « > 0. 
We have the induced variation of our 1-form 


w(x)dx h w(x + €&(x))d(x + €&(x)) =: w(x) + €6@(x). (1.1.39) 


By Taylor expansion, we have 


: 0g! 
or + €&(a))d(x + €€(x)) = (« ‘w) («x te ae as‘) 
x 
+ higher order terms (1.1.40) 
from which we conclude that for « > 0 
OO; ky j 0é! k 
6a = gee dx! + wi ax : (1.1.41) 


Of course, since oe =F £ axk = dé', the last term in (1.1.41) agrees with the one required 
by (1.1.18). 

To put the preceding into a slogan: For setting up transformation rules in geom- 
etry, mathematicians keep the point fixe and change the coordinates, while physi- 
cists keep the same coordinates, but move the point around. The firs approach is 
well suited to identifying invariants, like the curvature tensor. The second one is 
convenient for computing variations, as in our discussion of actions below. 

So far, we have romputed derivatives of functions. We have also talked about 
vector field V(x) =v a) Tas objects that depend differentiably on their ar- 
guments x. Of course, we can do the same for other tensors, like the metric 
gij(x)dx' @ dx/. This naturally raises the question about how to compute their 
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derivatives. This encounters the problem, however, that in contrast to functions, the 
representation of such tensors depends on the choice of local coordinates, and we 
have described in some detail that and how they transform under coordinate changes. 
Precisely because of that transformation, they acquire a coordinate invariant mean- 
ing; for example, the operation of a vector on a function or the metric product be- 
tween two vectors are both independent of the choice of coordinates. 

It now turns out that on a differentiable manifold, there is in general no single 
canonical way of taking derivatives of vector field or other tensors in an invariant 
manner. There are, in fact, many such possibilities, and they are called connections 
or covariant derivatives. Only when we have additional structures, like a Riemannian 
metric, can we single out a particular covariant derivative on the basis of its com- 
patibility with the metric. For our purposes, however, we also need other covariant 
derivatives, and therefore, we now develop that notion. We shall treat this issue 
from a more abstract perspective in Sect. 1.2 below, and so the reader who wants to 
progress more rapidly can skip the discussion here. 

Let M be a differentiable manifold. We recall that [(T M) denotes the space of 
vector field on M. An (affine connection or covariant derivative on M is a linear 
map 


V:I(TM) @eI(TM) > (TM), 
(V,W)H VyW 


satisfying: 


(i) V is tensorial in the firs argument: 


Vvitv,W=Vy,W+Vy,W forall Vi,V¥o,W el(TM), 
VrvW=fVvW forall f €C°(M),V,WeT(TM); 


(ii) V is R-linear in the second argument: 
Vv (W, + W2) =VvW,+VyW2 forall V, Wi, Wo Ee T(TM) 
and it satisfie the product rule 


Vv(fW)=V(f)W+fVvyW forall fe C~(M),V,W eT(TM). 
(1.1.42) 


Vv W is called the covariant derivative of W in the direction V. By (i), for any 
xo € M, (Vy W)(xo) only depends on the value of V at x9. By way of contrast, it also 
depends on the values of W in some neighborhood of x9, as it naturally should as 
a notion of a derivative of W. The example on which this is modeled is the Euclidean 
connection given by the standard derivatives, that is, for V = V! 3, W=wi 4, 


OW! a 
eucl W= i 
Vv = ax! OxJ° 
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However, this is not invariant under nonlinear coordinate changes, and since a gen- 
eral manifold cannot be covered by coordinates with only linear coordinate trans- 
formations, we need the above more general and abstract concept of a covariant 
derivative. 

Let U be a coordinate chart in M, with local coordinates x and coordinate vec- 


tor field ae ig wa (d = dim M). We then defin the Christoffel symbols of the 
connection V via 
Vip eee 1.1.43 
i Oxd 1 9yR ae, 
Thus, 
aw/ t 9 
VyvW=V! Sh ae GWEN oe pe (1.1.44) 


In order to understand the nature of the objects involved, we can also leave out 
the vector fiel V and consider the covariant derivative VW as a 1-form. In local 
coordinates 


a 
VWw= Wi ax", (1.1.45) 
with 
; aw! 
Wi= a Wir’. (1.1.46) 


If we change our coordinates x to coordinates y, then the new Christoffel symbols, 


, 1.1.47 
ay! ay” Im dy” ( ) 
are related to the old ones via 
~ ax! dxJ a2x* ) ay” 
_ Jpk 
Vin VQ) = {rhe dyt ay T Bylaym aa (1.1.48) 


In particular, due to the term the Christoffel symbols do not transform as 


a2xk 
aylaym? 
a tensor. However, if we have two connections ! V, *V, with corresponding Christof- 
fel symbols Des ee then the difference re = 'F ‘ does transform as a tensor. 
Expressed more abstractly, this means that the space of connections on M is an 
affin space. 


For a connection V, we defin its torsion tensor via 
T(V,W):=VyW-VwV-[V,W] forV,WeT(TM). (1.1.49) 


Inserting our coordinate vector field Pa as before, we obtain 


a a a a 
Tj :=T(—,.— )=Vea V2 
ax! ax/ ax? OX! at Ox! 
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(since coordinate vector field commute, 1.e., [ Ge asOe ee 0) 


ax!? axJ 
0 
k k 
Oa age 


We call the connection V torsion-free or symmetric if T = 0. By the preceding 
computation, this is equivalent to the symmetry 


ri,=%, for alli, j,k. (1.1.50) 


Let c(t) be a smooth curve in M, and let V(t) := é(t) (=e! (1); (c()) in local 
coordinates) be the tangent vector fiel ofc. In fact, we should instead write V (c(t)) 
in place of V(t), but we consider t as the coordinate along the curve c(t). Thus, in 
those coordinates 2 — a o. and in the sequel, we shall frequently and implicitly 
make this identification that is, switch between the points c(t) on the curve and 
the corresponding parameter values t. Let W(t) be another vector fiel along c, i.e., 


W(t) € Tea) M for all t. We may then write W(t) = w(t) (c(t) and form 


ax! 
Ve Waa +e OVs ee 
et) axi at oxi 
=) + 8 OMOFECO)= 
FE gh EE SO age 


(the preceding computation is meaningful as we see that it depends only on the 
values of W along the curve c(t), but not on other values in a neighborhood of 
a point on that curve). 

This represents a (nondegenerate) linear system of d first-orde differential oper- 
ators for the d coefficient ju! (t) of W(t). Therefore, for given initial values u' (0), 
there exists a unique solution W(t) of 


Ver) W(t) = 0. 


This W(t) is called the parallel transport of W(0) along the curve c(t). We also say 
that W(t) is covariantly constant along the curve c. 

Now, let W be a vector fiel in a neighborhood U of some point x9 € M. W is 
called parallel if for any curve c(t) in U, W(t) := W(c(t)) is parallel along c. This 
means that for all tangent vectors V in U, 


VvW=0, 


a . 
ai Wi + WIT;; =0 identically in U, for alli, k, 
xX 


. 0 
with W = W' aye in local coordinates. 
x 
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This now is a system of d? first-orde differential equations for the d coefficient 
of W, and so, it is overdetermined. Therefore, in general, such W do not exist. Of 


course, they do exist for the Euclidean connection, because in Euclidean coordi- 
a 


nates, the coordinate vector field 5yr are parallel. 


We defin the curvature tensor R by 


R(V, W)Z := Vv VwZ — VwVvZ — Viv,wiZ, (1.1.51) 
or in local coordinates 
0 0 r) 0 
k —_ we _ 
Rij as a(S axl ) axl (i, j,J=1,...,d). (1.1.52) 


The curvature tensor can be expressed in terms of the Christoffel symbols and their 
derivatives via 
Og a 


k k 
al carrie nae eee (1.1.53) 


Ri, = 
We also note that, as the name indicates, the curvature tensor R is, like the torsion 
tensor T, but in contrast to the connection V represented by the Christoffel sym- 
bols, a tensor. This means that when one of its arguments is multiplied by a smooth 
function, we may simply pull out that function without having to take a derivative of 
it. Equivalently, it transforms as a tensor under coordinate changes; here, the upper 
index & stands for an argument that transforms as a vector, that is contravariantly, 
whereas the lower indices /,i, j express a covariant transformation behavior. The 
curvature tensor will be discussed in more detail in Sect. 1.1.5. 

A curve c(t) in M is called autoparallel or geodesic if 


Vee = 0. (1.1.54) 


Geodesics will be discussed in detail and from a different perspective in Sect. 1.1.4. 
Here, we only display their equation and defin the exponential map. In local coor- 
dinates, (1.1.54) becomes 


cr) + rice Me ()=0 fork=1,...,d. (1.1.55) 


This constitutes a system of second-order ODEs, and given x9 € M, V € Ty, M, 
there exist a maximal interval Jy C R containing an open neighborhood of 0 and 
a geodesic 


cy:lyvoM 


with cy (0) = xo, Cv (0) = V. We can then defin the exponential map exp,, on some 
star-shaped neighborhood of 0 € Ty, M: 
CXPy, {VeT,M:lely}>—M, 
(1.1.56) 
Ve cy(l). 


We then have exp,,(¢V) =cv(‘) forO <1 <1. 
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A submanifold S of M is called autoparallel or totally geodesic if for all x9 € S, 
V €T,,S for which exp,, V is defined we have 


exp,,V ES. 


The infinitesima condition needed for this property is that 
VvW(x) €T,S 


for any vector fiel W(x) tangent to S and V € 7,58. 
Now, let M carry a Riemannian metric g = (-,-). 
We say that V is a Riemannian connection if it satisfie the metric product rule 


Z(V, W) =(VzV, W) +(V, VzW). (1.1.57) 


For any Riemannian metric g, there exists a unique torsion-free Riemannian con- 
nection, the so-called Levi-Civita connection V%. It is given by 


(Vi, W, Z) = s(viw, Z) —Z(V,W)+ W(Z,V) 


—(V,[W, Z]) + (Z,[V, W]) + (W,[Z, V])}. (1.1.58) 


The Christoffel symbols of V® can be expressed through the metric; in local coor- 
dinates, with gj; = ( d 9) we use the abbreviation 


13 28 
8ij,k = xk Si (1.1.59) 
and have 
1 
i= 58) (Bit, j + 8j1,i — 8ij,1); (1.1.60) 
or, equivalently, 
Bijk = By iy + gil yy = Ting + Dyes. (1.1.61) 


The Levi-Civita connection V® respects the metric in the sense that if V(t), W(t) 
are parallel vector field along a curve c(t), then 


(V(t), W(t)) = const, (1.1.62) 


that is, products between tangent vectors remain invariant under parallel transport. 


1.1.2 Complex Manifolds 


We start with complex dimension 1. The Euclidean space R? can be made into the 
complex vector space C! on which multiplication by complex numbers of the form 
a+ ib is defined with i = ./—1. Conventions: 


g=xtiy=x!+ix*, Z=x-iy. (1.1.63) 
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In the physics literature, z and z are formally viewed as independent coordinates. 
We defin 


a. ol . 0. ol . 
0. 3 ge ey) Be ae = 5 en tH 10y). (1.1.64) 
This is arranged so that 
d.z = 1, 0,z =0, (1.1.65) 


and so on. A function f : C > C is called holomorphic if 
dz f =0. (1.1.66) 


Mathematicians write f(z) for any function of the complex variable z. Physicists 
instead write f(z, Z), reserving the notation f(z) for a holomorphic function, that 
is, one satisfying (1.1.66) because that relation formally expresses independence of 
the coordinate z. Similarly, g : C > C is antiholomorphic if 


ag =0. (1.1.67) 


Another reason for the physics convention is to consider the complexificatio C7 
with coordinates (z, z’) of the Euclidean plane C = R?. The slice define by Z =z’ 
then yields the Euclidean plane, while (z, z’) =i(s + t, s — t) gives the Minkowski 
plane with metric dt? — ds. 

When we use the conformal transformation z = e”, with w= t+ i0, —co < 
Tt <oo and 0 <o < 2m, and pass from w = t + io to the light cone coordinates 
¢t=tt+o,¢7 =t-—<a (a so-called Wick rotation), we obtain the Minkowski 
metric in the form d¢tdc~. 

In complex coordinates, the Laplace operator (see (1.1.103), (1.1.105) below) 
becomes 


2 2 2 
A= + = a4. (1.1.68) 
We next have the 1-forms 
dz=dx+idy, dz =dx —idy. (1.1.69) 
This is arranged so that 
dz(dz) =1, dz(dz) = 0, (1.1.70) 
and so on, the analogs of (1.1.15). For a vector v! Z + i ys we write 
v:=v! +iv’, visu! —iv’, (1.1.71) 
and (in fla space) 
U, I= 5! — in), Us =50' +i), (1.1.72) 
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In this notation, the Euclidean (flat metric on R?, g11 = g20 = 1, g12 = 0, becomes 
& &§ &§ 


1 : = 
8§2z = 8% = 2? Sec = 827 = 0, a “= 2, . 
(1.1.73) 
This is set up to be compatible with (1.1.4). Thus, (1.1.72) becomes a special case 


of 
Vz = Bev + Bezv*. (1.1.74) 


The area form for this metric is 
sz Adz = dx Ady. (1.1.75) 


The conventions become clearer when we observe 


yj 811822 — 8], 4x Ady = 4/ 8287 — 83,dz Adz. (1.1.76) 


Also, for a twice covariant tensor, 


1 1 
Viz = aM + 2iVi2 — V2), Vz = a — 2iVi2 — V22), 


; (1.1.77) 
V2 = Vz, = qi + Vo) 
of which (1.1.73) is a special case. 
The divergence is (in fla space) 
A,1u! + 0,2u7 = O,v? + 8:0". (1.1.78) 


The divergence theorem (integration by parts, a special case of Stokes’ theorem) is 
here 


/ (8,02 + d:02)Ldz A dz= =f (v2dz — vdz) (1.1.79) 
Q 2 2 Jag 


with a counterclockwise contour integral around Q. 

We now tum to the higher-dimensional situation. The model space is now C4, the 
d-dimensional complex vector space. The preceding expressions define for d = 1 
then get equipped with coordinate indices: 


z=(z!,...,2%), with z/ =x/ +iy/ (1.1.80) 
using (2), 7) 5.35%, y?) as Euclidean coordinates on R2“, and 
zi = xt —iy/, 


Likewise 


0 1 0 0 
0; = — = l ; 1.1.81 
arr a) (sn +g) i 
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and so on. Then, a function f : C4 + Cis holomorphic if 

of =0 (1.1.82) 
fork=1,...,d. 


Definitio 1.2 A complex manifold of complex dimension d (dimc M = d) isa dif- 
ferentiable manifold of (real) dimension 2d (dimp M = 2d) whose charts take val- 
ues in open subsets of C4 with holomorphic coordinate transitions. 


A one-dimensional complex manifold is also called a Riemann surface, but that 
subject will be taken up in more depth in Sect. 1.4.2 below. 

Let M again be a complex manifold of complex dimension d. Let TRM :=T,M 
be the ordinary (real) tangent space of M at z. We defin the complexifie tangent 
space 


TOM :=TM @RC (1.1.83) 
which we then decompose as 
Cc a ) / " 
T.-M =C) —,—7; =T,MOT,M, (1.1.84) 
az azi . : 


where T/M = c{4) is the holomorphic and T./’/M = C{5} the antiholomor- 


phic tangent space. In T©M, we have a conjugation mapping 3 to rae and so 
Z z ; 


T;'M = T!M. The same construction is possible for the cotangent space, and we 
have analogously 


T2°M =C{dz) ,dz/} =: TY MOT? M. (1.1.85) 


The important point is that these decompositions are invariant under coordinate 
changes because those coordinate changes are required to be holomorphic. In par- 
ticular, we have the transformation rules 


/ ded 7 ack , oc 
dia ah dg 2 (55 )awi = dyit (1.1.86) 
dw w w 


when z = z(w). 
The complexifie space Q*(M; C) of k-forms can be decomposed into subspaces 
Q?:4(M) with p+q =k. Q?:4(M) is locally spanned by forms of the type 


w(z) =n(z)dzit A-- Adziv Adzit n+». Adz, (1.1.87) 


Thus 


Q*(M) = QB QP-4(M). (1.1.88) 
pt+q=k 
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We can then let the differential operators 
1/ a a ; 
ef Pas eid 
d= (5 in; eax +idy’) and 
ia ‘ (1.1.89) 
y ; Psa 
b= (Sting )@ idy’) 
operate on such a form by 
0 . . . 7 T 
Jo= rae! Adz! A-+»Adz'? Adz! A+++ A dz!4 (1.1.90) 
z 
and 
= 0 7 . . 7 rs 
jos it Adz n--- A dzie Adzit n-- Adz’, (1.1.91) 
zd 
4 and @ yield a decomposition of the exterior derivative d: 
Lemma 1.2 
d=a+4 (1.1.92) 
Moreover, 
ad =0, a0 =0, (1.1.93) 
ad = —dd (1.1.94) 
Proof 
- Ifa a lf a a ; 
aE ie eS J 4 idyi ; J _ jdyi 
d+0= (5 in; eax +idy/)+ (5 +55 Jas idy/) 
0 . a . 
= — dx! + —dy! =d. 
ox or dys y 
Consequently, 
0=d*=(8+4)(0+8)=0? +904 90407 
and decomposing this into types yields (1.1.93), (1.1.94). 
1.1.3 Riemannian and Lorentzian Metrics 
In local coordinates x = (x!,..., x7), a metric is represented by a nondegenerate, 
symmetric matrix 
(gij(®))i, jal,....d (1.1.95) 
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smoothly depending on x. Being symmetric, this matrix has d real eigenvalues, and 
being nondegenerate, none of them is 0. When they are all positive, the metric is 
called Riemannian. When only one is positive, and therefore d — 1 ones are nega- 
tive, it is called Lorentzian.* The prototype of a Riemannian manifold is Euclidean 
space, R@ equipped with its Euclidean metric; the model for a Lorentz manifold is 
Minkowski space, namely R¢ equipped with the inner product 


0.0 11 d—1d-1 
(X,y) =x y xX y mex" vy 
for x = (x9, x!,...,x4¢74), y= (y°, yh er yt}, (It is customary to use the in- 
dices 0,...,d — 1 in place of 1,...,d in the Lorentzian case, in order to better 


distinguish the time direction corresponding to 0 from the spatial ones.) This space 
is often denoted by R!-4—!. 
The product of two tangent vectors v, w € T,M with coordinate representations 


(v!,..., v“) and (w!,..., w%) (ie. v= vi, w= wi 2) is then, as in (1.1.21), 
(v, w) == gij(x(p))v'w!. (1.1.96) 


In particular, (4, 4) = g;j. In a Lorentzian manifold, a vector v with (v, v) > 0 


is called time-like, one with (v, v) < 0 space-like, and a nontrivial one with ||v|| = 0 
light-like. 

A (smooth) curve y : [a,b] — M ([a, b] a closed interval in R) is called time- 
like when (y(t), y(t)) > 0 for all t € [a, b]. Light- or space-like curves are define 
analogously. 

Similarly, the length or norm of v is given by 


Nie 


lvl] = (v, v) (1.1.97) 


if (v, w) > 0, and 
1 
Il vl] == —(—(v, v))2 (1.1.98) 
if (v, w) <0. On a Riemannian manifold, of course all vectors v 4 0 have positive 
length. 
Starting from the product (1.1.96), a metric then also induces products on other 
tensors. For example, for cotangent vectors w = wjdx',A = Ajdx' € I M, we have 


(w, A) = g'I (x(p))@iAj;, (1.1.99) 


4The conventions are not generally agreed upon in the literature (see [81] for a systematic survey 
of the older literature). The one employed here seems to be the one followed by the majority 
of physicists. Sometimes, however, for a Lorentzian metric, one requires d — | positive and 1 
negative eigenvalues. Of course, this simply changes the convention adopted here by a minus sign, 
without affecting the geometric or physical content. The latter convention looks natural when one 
wants to add a temporal dimension to already present spatial ones. The convention adopted here, 
in contrast, is natural when one starts with kinetics described by ordinary differential equations 
derived from a positive definit Lagrangian. Thus, the temporal dimension is the primary one and 
counted positively, whereas the additional spatial ones then lead to fiel theories. 
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that is, the induced product on the cotangent space is given by the inverse of the 
metric tensor. As a check, the reader should verify that this expression is invariant 
under coordinate transformations, with the transformation behavior of the metric 
now presented (or recalled from (1.1.20)). 

Let y= f(x). v and w then have representations (o!,..., 5%) and (w,..., w%) 


with o/ = v oat The metric in the new coordinates, denoted by 


hxe(y), then satisfie 


hye(f (x)) 0° w* = (v, w) = gij(x)u'w!. (1.1.100) 
Therefore, the transformation rule is the one given in (1.1.20), 
aft af! 
hil FO) 2 qf = SiO). (1.1.101) 
Given a metric (g;;(*))j, j=1,....d> We put 
J/g := ,/ det(gij) (1.1.102) 


and defin the Laplace—Beltrami operator (Laplacian for short) acting on C°(M) 
as 


A:= Ag := 


ain (vas iJ =) (1.1.103) 


We assume that our manifold M is compact (and, as always, without boundary). We 
then have the integration by parts formula, using (.,.) for the product on 1-forms 
induced by the Riemannian metric g, 


af 8 
[rapasyyeds'---axt = f gE O% ygax!.-ast 


Ox! OxJ 


=~ f fasvgdx'.--ds4 (1.1.104) 


where Jgax! ...dx@ is the volume form dvol g for the Riemannian metric as de- 
fine in (1.1.26). (Note that we are always assuming that our manifold M is oriented. 
This avoids sign ambiguities in the volume form and permits global integration as 
in (1.1.104).) 

In the Euclidean case, the Laplacian is simply the sum of the pure second deriv- 
atives, 


d 
s=D Ge GY (1.1.105) 


(cf. also (1.1.68) above). For the Minkowski metric, we have 


a, . 2 oe 


= d(x°)2 = a(x!)2’ 


(1.1.106) 


and this operator is often denoted by CU in the literature. 
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Generalizing (1.1.98), the metric g induces a product (w,v) on p-forms, see 
(1.1.25), and we can then defin the formal adjoint d* of the exterior derivative d 
via 


[ian »yavol, = [ (u.a"v) avo (1.1.107) 
for a (p — 1)-form yw and a p-form v. (Since d : Q?(M) > Q?+!(M), i.e., d maps 
p-forms to (p + 1)-forms, d* : Q?+!(M) > Q?(M) maps (p + 1)-forms to p- 
forms.) On functions, we then have 

Af =—d*df. (1.1.108) 
More generally, one define the Hodge Laplacian on p-forms by 


dd* +d*d. (1.1.109) 


Since d* f = 0 for functions, i.e, 0-forms f (for the simple reason that there do 
not exist forms of degree —1), this is a generalization of (1.1.108)—up to the sign, 
and these differing sign conventions unfortunately cause a lot of confusion. We then 
have the general integration by parts formulae for p-forms 


[iaran,») avel, = | ldu.dv) dvolg = f (u.d"dv) dvoly (1.1.110) 
and 
[ae +d*d)p, v) dvole = [du.av + (d*, d*v)) dvoly 
= fu. (dd* + d*d)v) dvolg. (1.1.111) 


Let us briefl explain the relation with the cohomology of the (compact, oriented) 
manifold M.A p-form a is called closed if 
dw=0, (1.1.112) 
and it is called exact if there exists some (p — 1)-form 7 with 


wo =dn. (1.1.113) 


Because of d od = 0, see (1.1.37), any exact form is closed. Two closed p-forms 
w!, w* are considered as cohomologically equivalent if their difference is exact, i.e., 


if there exists some (p — 1)-form 7 with 
wo! — w* =dn. (1.1.114) 


The equivalence classes of p-forms constitute a group, the pth (de Rham) cohomol- 
ogy group H?(M) of M. When M carries a Riemannian metric g, one can identify 
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a natural representative for each cohomology class as the unique form jz that mini- 
mizes 


i (w, w)dvol, (1.1.115) 
M 
in its equivalence class. This minimizing form yu is then harmonic in the sense that 
(dd* + d*d)u=0, (1.1.116) 
or equivalently (as follows from (1.1.111) and the nonnegativity of the two terms in 
the middle integral) 
du=0 and d*w=0. (1.1.117) 


Thus, a harmonic form is closed (dj = 0) and coclosed (d* 4 = 0). 

Since M is compact, the dimension b,(M) (called the pth Betti number of M) of 
H?(M) is finite This follows for instance from the fact that the elements of H? (M) 
are identifie with the solutions of the elliptic differential equation (1.1.116). It is 
a general result in the theory of elliptic partial differential equations that their solu- 
tion spaces satisfy a compactness principle. 


1.1.4 Geodesics 


The length of a smooth (or at least rectifiable curve y : [a,b] > M is 
b 
dy 
L = —(t 
=| | amo) 


where we abbreviate <! (ft) := £ (x!(y(t))). Thus, time-, light-, or space-like curves 
have positive, vanishing, or negative length, respectively 
The action of a time-like curve y is 


1 b 
s):=5 , 


Here, y is considered as the orbit of a mass point, which explains the name “ac- 
tion’. In the mathematical literature, the action is often called energy, an unfortunate 
choice of terminology. 

A massive particle in a Lorentzian manifold travels along a world line x(t) with 
arclength 


b 
a=, JejevonvOvod (1.118) 


dy 
a 


2 b 
a=>f si(x(v(t)))x' (Nx (dt. (1.1.119) 


T] 1 
‘= i (gap (x(t))%% (c)£ (r)) dr, 


0 


where we assume gapx%x? > 0 along the world line. Thus, the movement is time- 
like. When in place of gagx%x* > 0, we have 


Saupx” iB = 0, 
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then the particle is massless, that is, a photon. Bupha < 0 would correspond to 


a movement with speed higher than that of light and is excluded. 
By Hélder’s inequality, for a time-like curve y, 


b i b 
; ar<b—a3( f 


with equality precisely if || oy || = const. This means that 


dy 


dt 


2 \3 
ar) (1.1.120) 


>| 


L(y)? <2(b-a)S(y), (1.1.121) 


again with equality only if y has constant norm. 
The distance between p,q € M is 


d(p,q):=inf{L(y): y : [a,b] > M with y(a) = p, y(b) = q}. (1.1.122) 


By the change of variables formula, if y : [a,b] > M isacurve, ando : [a’, b’] > 
[a, b] is a change of parameter, then 


L(yoo)=L(y). (1.1.123) 


This is no longer so for the action, as follows with a little reflectio on the equality 
discussion in (1.1.121). It is instructive to look at the stationary points of the action: 


Lemma 1.3 The Euler-Lagrange equations (see Sect. 2.3.1 below) for the action S 
are 


Pur Tig (x(t) x (NiO) =0, i=1,...,d, (1.1.124) 


where ri, are the Christoffel symbols (1.1.60). 


Proof As will be derived in Sect. 2.3.1 below, the Euler-Lagrange equations of 
a functional 


b 
10) = f ft, x(t), x(t))dt 


are given by 


d 
i ge a ee 
dt ox' = ox! 


Thus, for our action S, 


d . : 
7 sie (1) 4" (1) + gf XO)XI(O) — gj, aM)! OF (DN =0 


fori=1,...,d, 
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hence 
Bink” + a yikl + gin ex x* + gj1,0%°s! — gi ittt* =0. 
Renaming indices and using giz = gi, we get 
2gem%” + (gex,j + 8jek — 8 jk.oe/X* =0 


and from this 


go semi” + 58) (Bex) + 8 jek — gi, xix* =0. 


Because of g!“ g¢m_ = 6, and thus 9!" gm%” = xX!, (1.1.124) follows. 


Definitio 1.3 A geodesic is a curve y = [a,b] > M that is a critical point of the 
action S, that is, satisfie (1.1.124). 


Briefl interrupting our discussion, we point out that (1.1.124) is the same as 
(1.1.55). In other words, taking up the discussion at the end of Sect. 1.1.1, for the 
Levi-Civita connection, the two definition of a geodesic, being autoparallel as in 
Sect. 1.1.1, or being a critical point of the action functional S as define here, are 
equivalent. In particular, we can also write the geodesic equation invariantly, as in 
(1.1.54), with a slight change of notation: 


Vai =0. (1.1.125) 


We now return to the discussion of geodesics as critical points of S. We say that 
a curve y is parametrized proportionally to arc length if (x, x) = const. 


Lemma 1.4 Each geodesic is parametrized proportionally to arc length. 


Proof For a solution of (1.1.124), 


|: oer 

gas. = aH ii (x(t))x' (x! (t)) 
= gi kx) + gijK'X) + giz nxt x* 
= —(g jk. + Bej.k — Bek, je XK! + ge ex* KOR! 
=0, since gjx,ex x*x) = gop, jx x", 


Consequently, (x, x) = const., and hence the curve is parametrized proportionally 
to arc length. 


As already discussed in Sect. 1.1.1, the next result follows from the Picard— 
Lindeléf theorem about the local existence and uniqueness of solutions of systems 
of ODEs. 
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Lemma 1.5 For each p € M,v € T,M, there exist e > 0 and precisely one geodesic 
c:[0,e] > M 
with c(0) = p and c(0) = v. This geodesic c depends smoothly on p and v. 


We now assume that the metric g on M is Riemannian, even though results corre- 
sponding to those stated below also hold in the case of other signatures, in particular 
for Lorentzian metrics. 

If x(t) is a solution of (1.1.124), so is x (At) for any constant A € R. Denoting the 
geodesic of Lemma 1.5 by cy, 


t 
Cy(t) = cwo( =) fori > 0, t € [0, e]. 


In particular, cy, is define on [0, 5]. 

Since cy depends smoothly on v, and {v € T,M: ||v|| = 1} is compact, there 
exists €9 > 0 with the property that for ||v|| = 1, cy is define at least on [0, eo]. 
Therefore, for any w € T,,M with ||w|| < €0, cw is define at least on [0, 1]. Thus, 


as in (1.1.56): 
Definitio 1.4 Let p<¢M, Vp:={v €TpM : cy is define on [0, 1]}. 


exp,: Vp> M, 
(1.1.126) 
UE> Cy(1) 


is called the exponential map of M at p. 


One observes that the derivative of the exponential map exp, at 0 € TM is the 
identity. Therefore, with the help of the inverse function theorem, one checks that 
the exponential map exp p Maps a neighborhood of 0 € T,,M diffeomorphically onto 
aneighborhood of p € M. Since T, M is a vector space isomorphic to IR? (on which 
we choose a Euclidean orthonormal basis), we can consider the local inverse exp, 
as definin local coordinates in a neighborhood of p. These local coordinates are 
called normal coordinates with center p. In these coordinates, a basis of T, M that is 
orthonormal with respect to the Riemannian metric g is identifie with a Euclidean 
orthonormal basis of R@. This is the firs part of the next lemma: 


Lemma 1.6 /n normal coordinates, the metric satisfie 
gij(0) = dij, (1.1.127) 
T,(0) =0 (andalso gij,x4(0)=9) foralli, j,k. (1.1.128) 


Proof (1.1.127) follows from the fact that the above identificatio ®:7T,M = R¢ 
maps an orthonormal basis of T,, M w.r.t. the metric g (that is, a basis e),...eq with 
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(e;, €;) = 6;; onto an orthonormal basis of R@. For (1.1.128), in normal coordinates, 
the straight lines through the origin of R@ are geodesic, as the line tv, t € R, v € R¢, 
is mapped onto c;,(1) = cy(t), where c,(t) is the geodesic, parametrized by arc 
length, with c,(0) = v. 

Inserting now x(t) = fv into the geodesic equation (1.1.124), we obtain, using 
X(t) = 0, 

ri,(tv)v/vk =0, fori=l,...,d. 
In particular at 0, i.e., for t = 0, 
ri,(0)v/v* =0 forallve R*, i=1,...,d. 


i 


Using the symmetry T° ‘ pal; ? this implies 
ri, (0) =0 
for all i and also for all 2, m. By definitio of igs at 0 € R@, we obtain 
(giant 8ke,j — &jke) =9 
for all free indices, hence also 
Sjm,k + 8km,j — &jk,m = 9. 
Permuting the indices yields 


&kj,m + &mj,k — 8km,j = 0, 


which we add to obtain, for all indices, 


8 jm,k (0) =0. 


This is a very useful result. When one has to check tensor equations, one can 
do this in arbitrary coordinates because by the definitio of a tensor, results are 
coordinate independent. Now, it is often much easier to check such identities in 
normal coordinates at the point under consideration, making use of the vanishing of 
all firs derivatives of the metric and all Christoffel symbols. We shall often employ 
this strategy in the sequel. 

In fact, we can even achieve a little more: Let c(s) : (—a,a) — M be a geodesic 
parametrized by arclength, that is, (c(s), c(s)) = 1 for —a < s <a (see Lemma 1.4). 
Let v!(0),..., v“(0) be an orthonormal basis of T.(0)M with v! = é(0), and let 
v(t) € T-)M be the parallel transport of v! (0) along the geodesic c(s). We defin 
coordinates by mapping (x!,..., x“) in some neighborhood of 0 € R?@ to 


(c(x!), expe gy (x7v? (x!) + $x404(x!))). (1.1.129) 
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Lemma 1.7 The coordinates just described satisfy 


GG Cah), (1.1.130) 
PG 0.0) =, (11.131) 
(and also gij,¢(',0,...,.0) = 0) (1.1.132) 


for all —a < x! 


<a,i, j,k. 

Proof By Lemma 1.4, g1;(x!,0,...,0) is constant, in fact = 1 by our ar- 
clength assumption, as a function of x, Therefore, also gir", 0,...,0) =0. 
Moreover, since the Levi-Civita connection V respects the metric (see (1.1.62)), 
Ble 0,200) 

= (v!(x!), v/(x!)) = je for the other values of j,k. Therefore, also 


gjzi(x',0,...,0)=0 forall j,k. (1.1.133) 


We continue to evaluate all expressions at (x!,0,...,0). All rays tv for v in the 
span of v?,..., v/ are mapped to geodesics, because the exponential map is applied 
to them. So, we obtain, as in the proof of Lemma 1.6, that 


T3,(x',0,...,0)=0 


for i = 2,...,d and all @,m. By definitio of Tip we obtain ge (giek + 


8ke,j — Sjk.e) = 0 at (x',0,...,0) € R® for all free indices, hence also Sjmk + 
8km,j — &jkzm =O form =2,...,d. Permuting the indices to get gxjm + 8mj,k — 
8km,j = 0, adding these relations and combining them with (1.1.133) finall yields 
Bape a0) ...,0) =0 for all indices. 


11.5 Curvature 


We now want to discuss the curvature tensor R of the Levi-Civita connection V. We 
recall (1.1.51): 


R(X, Y)Z=VxVyZ — VyVxZ — Vix,yZ. (1.1.134) 


In local coordinates (cf. (1.1.52)), 


a a\a a) 
r( ) RE (1.1.135) 


axt’ axt Jax! “ axk 


We put 
Recij = 8kmRGij> (1.1.136) 
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i.e. 


nr) a0 
Rkeij =(a( =. sot) aoe wa} (1.1.137) 
There exist different sign conventions for the curvature tensor in the literature. We 
have adopted here a convention that hopefully minimizes conflic between them. 
As a consequence, the indices k and / appear in different orders at the two sides 
of (1.1.137). 
The curvature tensor satisfie the following symmetries: 


R(X, Y)Z=—-R(Y,X)Z, ie. Reeij =—Recji (1.1.138) 
for vector field X,Y, Z, W. 
R(X, Y)Z+ RV, Z)X + R(Z, X)Y =0, (1.1.139) 
or with indices 


Ryeij + Reige + Rejei =9 (1.1.140) 
(the firs Bianchi identity). 


(R(X, Y)Z, W) = —(R(X, Y)W, Z), (1.1.141) 
with indices 
Rxeij = —Rexij- (1.1.142) 
(R(X, Y)Z, W) = (R(Z, W)X,Y), (1.1.143) 
with indices 
Rreij = Rijxe. (1.1.144) 
7] ) C) 
Buh Reeii + ak Renij + Bue Baki =0 (1.1.145) 


(the second Bianchi identity). In order to practice tensor calculus, we give a proof 
of (1.1.145) in local coordinates. We recall (1.1.53): 


a a 
—-— rege rm rere (1.1.146) 


ko 
Rit =a jl ax/ im” jl jm il’ 


Since all expressions are tensors, we may choose normal coordinates around the 
point xo under consideration, i.e., for all indices 
gij (X0) = 5ij, 8ij,k(X0) =0 =F, (x0) (1.1.147) 


(1.1.146) then becomes 


1 
Rxeij = 5 Biksti + S0k,ij — Sje.ki — Sik,ej — Sek,ij + Sick) 
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1 
— 5 (Bik.ti + Bib.kj — Bjlki — Sik,ej)s (1.1.148) 


and also, differentiating (1.1.146) and using once more the vanishing of all terms 
containing firs derivatives of g;; at xo, 


1 
Rkeij,a = 3 (Bik.tih + 8it.kjh — &je.kih — Bik,ejh)- (1.1.149) 


This yields the second Bianchi identity: 


1 
Reeijn + Renij,k + Rokij.e = 5 Bik tin + Sit.kjh — &je,kih — Sik,ejh 


+ 8jtnik + 8ih.ejk — Sjh tik — Silshjk 
+ Bin.kie + Bik,nje — Bjk,nie — Bih,kje) 


=0. 


The sectional curvature of the plane spanned by the (linearly independent) tangent 


vectors X =f, Y= ni € T,M is define as 
K(X AY) := (R(X, Y)Y, X) ——., 1.1.150 
( ):= (R(X, Y) Xa yp ( ) 
(IX A Y|? = (X, X)(Y, Y) — (X, Y)?), with indices 
K(XAY)= Rijxeé’ ni Ek! _ Rijxegi nl éky! 
gikgje(ElEk nin’ —E'Einkn®) (ging je — ea 
1.1.151) 


The Ricci curvature in the direction X = &! a € T,M is define as the average of 
the sectional curvatures of all planes in 7; M containing X, 


a\ a 
Ric(X, X) = 2/4 R{ x, —- )—, X}, 1.1.152 
ic(X, X) =g (r( a) ( ) 


and the Ricci tensor is thus the contraction of the curvature tensor, 
Rix = Rijee = Rig. (1.1.153) 
(1.1.144) implies the symmetry 
Rizk = Rui. (1.1.154) 
The scalar curvature is the contraction of the Ricci curvature, 
R=" R= Ri. (1.1.155) 
For d = dim M = 2, the curvature tensor is determined by the scalar curvature: 


Rijxe = R( ike je — Bij ke)- (1.1.156) 


1.1 Riemannian and Lorentzian Manifolds 29 


For d = 3, the curvature tensor is determined by the Ricci tensor. For d > 3, the part 
of the curvature tensor not yet determined by the Ricci tensor is given by the Weyl 
tensor 


2 
Wijke = Rijke + Ta Bit Rai — gikRej + gjeRei — gjeRki) 


+ @— Dada Bsik8ei — 8it8kj)- (1.1.157) 


1.1.6 Principles of General Relativity 


General relativity describes the physical force of gravity and its relation with the 
structure of space-time. The fundamental physical insight behind the theory of gen- 
eral relativity is that the effects of acceleration cannot be distinguished from those 
of gravity. The presence of matter changes the geometry of space, and acceleration 
is experienced in relation to that geometry. In particular, the geometry of space and 
time is dynamically determined by the physical laws, and in contrast to other phys- 
ical theories, is thus not assumed as independently given. These physical laws in 
turn are deduced from symmetry principles, more precisely from the principle of 
general covariance, that is, that the physics should be independent of its coordinate 
description. For this, Riemannian geometry has developed the appropriate formal 
tools. 
Let M be a Lorentz manifold with local coordinates (x°, x!, x”, x3) and metric 


We recall the Christoffel symbols 


1 
Vey = 58°" (8p5.y + &y5,B = &By,d)s 


and those objects from which the essential invariants of a metric come, that is, the 
curvature tensor Revs = B3,y - Mby.3 + Dy Vs - Ins Ty and its contractions, 
the Ricci tensor (1.1.154), Rog = ess and the scalar curvature (1.1.156), R = 
& a Rap . 

The Einstein fiel equations couple the metric ggg of the underlying differen- 
tiable manifold with the matter and field on that manifold. These equations involve 
the Ricci curvature and are 


1 
RP — Sg Raat, (1.1.158) 


Here, k = ae where g is the gravitational constant. (T°), p is the energy— 


momentum tensor. It describes the matter and field present. When T is given, 
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the Einstein equations then determine the metric of space-time.” The presence of 
a nonvanishing energy-momentum tensor in the fiel equations makes space-time 
curved. The curvature in turn leads to gravity. (1.1.158) is equivalent to 


Rap ~ 58ap R = *Tap. (1.1.159) 

Taking the trace in (1.1.159) leads to R-— 2R = xT, that is, 
R=-—xT. (1.1.160) 
(T = g% Tyg; note that the dimension of M is 4.) Using (1.1.160), (1.1.159) be- 


comes equivalent to 


1 
Rap = «(Tua ~ 58687). (1.1.161) 
In the special case where (Tyg) = 0, that is, when neither matter nor field are 
present, (1.1.161) becomes 
Rug = 0, (1.1.162) 


i.e., the Ricci curvature of M vanishes. 

Hilbert discovered that the Einstein fiel equations can be derived from a varia- 
tional principle. In fact, they are the Euler-Lagrange equations for the action func- 
tional 


Lo(e)= | RY=ede = | RdVoly (x), (1.1.163) 
M M 


called the Einstein—Hilbert functional. To see this, we consider a family 


Bap = Sap + thap 


of metrics with (hag) having compact support if M is not compact itself. Quantities 
obtained from the metric g*, B will always carry a superscript t; for example, 

Rig 
is the Ricci tensor of gi... We also put 


d 
58a = 77 Sab) j1=0 = hog, 


d 
5Rop = a7 (Rab |r =0° etc. 


5Classically, the topology of M is assumed f xed. However, it turns out that the equations may 
lead to space-time singularities, like black holes, which will then affect the underlying topology. 
Such singularities can occur and are sometimes even inevitable, even if suitable and physically 
natural restrictions are imposed on the energy-momentum tensor, like nonnegativity. We do not 
pursue that issue here, however, but refer to [56]. There, also the cosmological implications of 
such singularities are discussed. 
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Finally, we shall use the abbreviation 
Vi=v—8. 


We then have 
d 
TLo(s')ia0= | a(Ry) dx. (11.164) 
dt M 

Now 


5(Ry) = 5(g*? Rapy) = 8% ySRap + Rapd(g"P y). 
We now claim that 
ap : 1 a a 
go’ SRop =divV | =——(yV") (1.1.165) 
y ax® 
for the vector fiel V with components 
VY = p57, — gf SI, (1.1.166) 
Proof of (1.1.165): Let p € M. We introduce normal coordinates near p; thus, at p, 
the metric tensor is diagonal and 
Sop,y(p)=9 and Tey (P) =0 foralla, B,y. 


In particular, at p 


a 
aye =0 foralla. 
In these coordinates, (1.1.165) then follows from the definitio of the Ricci ten- 
sor. While the Christoffel symbols yee as the components of a connection, do not 
transform tensorially, the 6P By do transform tensorially as derivatives, that is, as 
infinitesima differences of connections. The right-hand side of (1.1.165) is thus 
a tensor, and so is the left-hand side. The equality of two tensors can be checked in 
arbitrary coordinates. Since we have just verifie (1.1.165) in normal coordinates, 
(1.1.165) then also holds in arbitrary coordinates, and we have completed its proof. 
We now get 


i (Ry) = f at yaRapde + | Rapd(g"*y)ds 
M 
= [civvy ax + f Repdte"*yax 


= : Rapd(g y) dx (1.1.167) 


by Gauss’s theorem, since V has compact support. 
Now 


ioe 1 
iy=sy 'Sdet(gag) = 578 58ap 
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and moreover 
5g"? = —g” gh¥5g,,5 (from g gg, = 6%) 


and therefore 
1 
sen =r(5 gh gv? = 6° bays (1.1.168) 


(1.1.164), (1.1.167), (1.1.168) imply 


1 
sLo= [ (sei? - Re) yBeap dx =0. (1.1.169) 
M\2 
If this holds for all variations 6gqg with compact support, we have 
1 
eae ea, (1.1.170) 
which implies, as in the derivation of (1.1.162), that 


Rop = 0. (1.1.171) 


Einstein also tentatively introduced a cosmological constant A that has the effect of 
changing the Einstein—Hilbert functional (1.1.163) to 


La(g) =| (R — 2A)./—g dx (1.1.172) 
M 
and the Einstein fiel equations (1.1.158) to 
1 
a a i (1.1.173) 


While a nontrivial cosmological constant is presently appearing in some cosmolog- 
ical models, we put it to 0 for our present discussion. It is straightforward, however, 
to include a nontrivial A in the subsequent formulas. 

In the presence of some matter field $, we assume a Lagrangian 


Ly -|/ F(g,, V8¢)/—g dx (1.1.174) 
M 


depending on the field and their covariant derivatives w.r.t. the Levi-Civita connec- 
tion, as well as possibly also directly on the metric g. When we consider a variation 
58ap Of the metric that does not change the fields we put 


1 
sLi= >| T? Sgupdx. (1.1.175) 
2 Ju 


In other words, the energy—momentum tensor is define as the variation of the matter 
Lagrangian w.r.t. the metric. In order to fully justify (1.1.175), we need to observe 
that all the variations of all metric dependent terms in L are proportional to dgqg. 
For the volume form, this has been verifie in (1.1.168). The covariant derivative V% 
occurring in (1.1.175) also depends on the metric (see (1.1.60)). One easily com- 
putes, for example in normal coordinates, that the variation 51°%, of the Christoffel 
symbol is proportional to a combination of covariant derivatives of 5gg, and that 
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the covariant derivatives can then be integrated away by parts in the computation of 
5L,. When one then considers the full Lagrangian 


Li=Lo+«li, (1.1.176) 


with « as a coupling constant, we thus obtain from (1.1.169) and (1.1.175), for 
variations dgqg of the metric, 


1 
a= | (se? R- RY +67" \ybgapdx. (1.1.177) 
M 


Thus, when 6L = 0 for all variations of the metric, we obtain (1.1.158). 

For a more extended discussion of this variational principle, we refer to the pre- 
sentation in [81] or [56]. 

Finally, we mention the so-called semiclassical Einstein equations 


1 i 
Rap — 5 8apR = «(v|Top|W) (1.1.178) 


where the energy momentum tensor Tyg in (1.1.159) is replaced by the expectation 
value of the energy-momentum operator with respect to some quantum state w. This 
quantum state in turn depends on the metric g through the Schrédinger equation. 
Here, we are invoking concepts that fin their natural place in the second part of 
this book. Equation (1.1.178) arises in the context of quantum field on an external 
space-time. The coupling of a quantum system to a classical one in (1.1.178) leads 
to questions of consistency which we do not enter here. We refer to the discussion 
in [74]. 

Variational principles will be taken up in more generality below in Sect. 2.3.1, 
and in Sect. 2.4, the energy-momentum tensor will appear again. Also, we shall see 
there that a consequence of Noether’s theorem (see Sect. 2.3.2) is that under the 
fundamental assumption of general relativity, namely invariance of L under coordi- 
nate transformations—that is, diffeomorphism invariance—the energy-momentum 
tensor is divergence free. 
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1.2.1 Vector and Principal Bundles 


Let M be a differentiable manifold. In this section, we present the basic aspects of 
the theory of vector and principal bundles. We point out that we have already studied 
one particular vector bundle over M in Sect. 1.1.1, its tangent bundle TM. 
A fibe bundle (or simply, a bundle) over M consists of a total space E, a fibe 

F (both of them also differentiable manifolds), and a projection x : E — M such 
that each x € M has a neighborhood U for which E\y = a~!(U) is diffeomorphic 
to U x F such that the fiber are preserved. This means that there exists a diffeo- 
morphism 


g:n (UU) >UxF 
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with 
T=plog. (1.2.1) 


(p,:U x F > U is the projection onto the firs factor.) 
g is called a local trivialization of the bundle over U. Let {U,} be an open cov- 
ering of M with local trivializations {g,}. If 


Ug Ug #9, 
we obtain transition maps 
Ppa: Ue 1 Ug — Diff(F) (= group of diffeomorphisms of F’) 
via 
BO Py (x, V) = (X, Ppa(X)v). (1.2.2) 


Omitting the base point, which is fi ed by (1.2.1), from our notation, we shall usu- 
ally simply write 


Ppa = HB Py: 
We have 
forx€ Ug: Qaa(x) =idr, (1.2.3) 
forx €UygNUg: Gap(*)Ppalx) = ide, (1.2.4) 
for x EUgN UBNUy: Gay (X)Gyp(X)Ppa(x) = ide. (1.2.5) 


E can be reconstructed from its transition maps: 


E=|[U.x F/~ (1.2.6) 


with 
(x,v)~Q,w):iex=y and w= ¢gq(x)v 
(x € Ug, y € Ug,v, we F). 


When we have some (differentiable) fy : Uy — Diff(F’), we can replace the trivial- 
ization gy over Uy by 


C= Ta OUas (1.2.7) 
and conversely, we can obtain any trivialization g/, over Uy, in this manner via 
for = Py Va" 
(gy : assigns to each x the diffeomorphism inverse to gy (x)). (1.2.8) 


If fa, fg are as above, the transition maps change according to 


-l1_ 


= _ 
Vga = 0p 2% = fp°%pa° fa (1.2.9) 
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The special case where all transition maps take their values in an Abelian subgroup 
A of Diff(F’) yields some additional structure: The transition maps {~gq} then defin 
a Cech cocycle on M with values in A, because (1.2.4) and (1.2.5) imply 


d({ppa}) =9 


for the boundary operator 6. By (1.2.9), two such cocycles {gga} and {Peat defin 
-1 
a 


the same bundle if {g,, 0 Peas is a coboundary. Thus, in this case, we can consider 


a bundle as a cohomology class in H!(M, A).° 
A section of E is a smooth map 


s:iM>E 
satisfying 
mos =id. 


We denote the space of sections by C°(E) or '(E). 

For our purposes, we shall only need two special (closely related) types of fibe 
bundles. The fibe F will be either a vector space V or a Lie group G. The important 
general principle here is to require that the transition maps respect the corresponding 
structure. Thus, they are not allowed to assume arbitrary values in Diff(F’), but only 
in some fi ed Lie group G. G is called the structure group of the bundle. 

According to this principle, the fibe of a vector bundle is a real or complex 
vector space V of some real dimension n, and the structure group is G/(n, R) or 
some subgroup. A bundle whose fibe is a Lie group G is called a principal bundle, 
and the total space is denoted by P. The structure group is G or some subgroup, and 
it operates by left multiplication on the fibe G. Right multiplication on G induces 
a right action of G on P via local trivializations: 


PxG—-P, (x,g)*h=(1, gh) forp=(x,g)€P, 


with the composition rule (p * g)h = p * gh. This action is free, that is, p * g = 
Pp = g =e (neutral element). The projection z : P > M is obtained by simply 
identifying x € M with an orbit of this action, that is, 


z:P—>P/G=M. 


The groups Gi(n, R), O(n), SO(n), U(n) and SU(n) will be the ones of interest 
for us. Acting as linear groups on a vector space, they preserve linear, Euclidean, or 
Hermitian structures. For example, a Euclidean structure, that is, a (positive definite 

scalar product, is an additional structure on a vector space. According to the general 
principle, if we have such a structure on our fibe , it has to be respected by the tran- 
sition maps. As before, this restricts the transformations permitted. In our example, 
we thus allow only O(n) in place of G/(n, R). Such a restriction of the admissi- 
ble transformations by imposing an additional structure that has to be preserved is 
called a reduction of the structure group. 


We assume here that M is connected; otherwise, in place of A itself, we should utilize the locally 
constant sheaf of A. 
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Principal and vector bundles are closely related. Let P — M be a principal bun- 
dle with fibe G, and let the vector space V carry a representation of G. We then 
construct a vector bundle E with fibe V using the following free right action of G 
on PX V: 


PxVxGo>PxV, 
(p,v) * 8 =(p*g,g |v). 
The projection 


PxV-P->M 


is invariant under this action, and 
E:=PxgV:=(PxV)/G>M 
is a vector bundle with fibe 
GxgV=(Gx V)/G=V 


and structure group G. Via the left action of G on V, the transition maps for P yield 
transition maps for E. Conversely, if we have a vector bundle with structure group 
G, we construct a G-principal bundle P by 


[ [Ue x G/~ 
a 
with 


(Xa, 8a) ~ (XB, 8p) <> Xq =XgiINUYNUBZ and gg = GfalX)8a- 


(Here, {Uy} is a local trivialization of E with transition maps gq; these transition 
maps are in Gi(n, R). Since the elements gq are in the structure group G which 
is assumed to be a linear group, that is, a subgroup of G/(n, R), we can form the 
product ~gq(x)ga.) P can be viewed as the bundle of admissible bases of E. In 
a local trivialization, each fibe of E is identifie with R” or C”, and each admissible 
base is represented by a matrix with coefficient in R or C. The transition maps then 
effect a base change. In each local trivialization, the action of G on P is given by 
matrix multiplication. 

All standard operations on vector spaces extend to vector bundles. If we have 
a vector bundle E with fibe V, over x, we can form the dual bundle E* with fibe 
the dual space V* of V,. Applying this construction to the tangent bundle TM 
yields the cotangent bundle 7* M. If E, and E> are vector bundles, we can form the 
bundles E) @ E2, E; @ Ez and E| A E2 by performing the corresponding operations 
on the fibers In particular, from the cotangent bundle T7* M, we obtain the bundle 
A? (M) introduced in (1.1.23), whose sections are the exterior p-forms. 
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1.2.2 Covariant Derivatives 


Let E be a vector bundle over M. We may view E as a family of vector spaces 
parametrized by M. A local trivialization g over U identifie the fiber over U with 
each other. Changing the local trivialization then also changes this identificatio of 
the fibers The identificatio thus depends on the choice of a local trivialization and 
is therefore not canonical. Hence, while we can decide whether a section of E is 
differentiable because all transition maps depend differentiably on x and therefore 
do not affect the differentiability of a section in some local trivialization, there is 
no canonical way to specify the value of its derivative. In particular, we do not have 
a criterion for a section being constant along a curve in M. 

Therefore, in order to be able to differentiate a section, we need to introduce and 
specify an additional structure on E, a so-called covariant derivative or connection. 
We point out that this includes and generalizes the concept of a covariant derivative 
developed in Sect. 1.1.1 for the tangent bundle. 

A covariant derivative is an operator 


D: T(E) > T(E) ®c~(my) T(T*M) 
with the following properties: For 0 € T(E), V € Tx M, we write 
Do(V) =: Dyo 
and require (for all x € M): 
(i) D is tensorial in V: 
Dyzwo =Dyo+Dwo VV,WeET,M, o € T(E), 
Drvo=fDyo VVET,M, f €C™(M), o ET (E£). 


(Remark: It does not really make sense to multiply an element V € 7, M by 
a function f € C°(M). What the preceding rule means is that when we take 
a section V € (TM) of the tangent bundle, the value (Dyo)(x) depends only 
on the value of V at the point x, but not on the values at other points. This is 
not so for o as rule (iii) shows.) 
(ii) D is linear ino: 
Dy(o+1t)=Dyo+Dyt VVET,M, o,t ET (E). 
(iii) D satisfie the product rule: 
Dy(fo)=V(f)o+fDyo YWeET.M, feC*(M), o€T (BZ). 
An example, which is not really typical, but in a certain sense a local model, is the 
trivial bundle M x R over M, where we can put 


Dyo :=do(V)=V(o) 


to obtain a covariant derivative. In the general case, let y be a trivialization of E 
over U, 


Ev =UxR" (<9(7!(U))). 
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Via this local identification a base of R” yields a base 1), ..., 4n of sections of E\y. 
Any section o can then be written over U as 
o(x) =a" (x) Ux (x). 
Then 
Do = (da*) ux + a* Dug. (1.2.10) 
Dup = Al j. Geane) 


Here, for each x, A(x) = (AL(2))  jo4,.. 
ment of gl(n, R) ® TM. In symbols, 


Ae€T(gl(n, R) @ T* My). 


isa T; M-valued matrix, that is, an ele- 


In our trivialization, we write this as 
D=d-+A. (1.2.12) 


We now wish to determine the transformation behavior of A under a change of the 
local trivialization. Let {U,} be an open covering of M that yields a local trivializa- 
tion with transition maps 


Ppa? Ue Ug > Gl(n, R). 


D define a T*M-valued matrix Ag on Uy. Let the section jz be represented by 
Ma on U,. A Greek index a here is not a coordinate index, but refers to the chosen 
covering {U,}. Thus 


Mp =Ppalka on Ug Uz. 
This implies 
(pad + Aw)Ma = (d+ Apg)eg onUg Uz. (1.2.13) 


On the left-hand side, we have firs computed Dy in the local trivialization deter- 
mined by U, and then transformed the result into the local trivialization determined 
by Ug, while on the right-hand side, we have directly expressed Dy. in the latter. 
We conclude 


Aa = G54 ¥pa + Ppy ABP Ba- (1.2.14) 


We have thus found the transformation behavior. Ag does not transform as a ten- 
sor, because of the term Ppa Ppa The difference of two connections, however, 
does transform as a tensor. The space of all connections on a given vector bundle 
is therefore an affin space. The difference of two connections is a gl(n, R)-valued 
1-form. 

Having a connection D on a vector bundle E, it is now our aim to extend D 
to associated bundles, requiring suitable compatibility conditions. We start with the 
dual bundle E*. Let 


(,):E@E*>R 
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be the bilinear pairing between FE and E*. The base dual to some base [1], ..., Ly 
of E is denoted by Cae ..., P*", Le, 
(ui, w*4) = 8! (1.2.15) 
We then defin the connection D* on E* by requiring 
d(w, v*) = (Du, v) + (uw, D*v) (1.2.16) 


for all « € P(E), v € P(E*). In our above notation 
D=d+A, D*=d+A*. (1.2.17) 
From (1.2.15) (cf. (1.2.11)) we then compute 
0 =d(ui, A!) = (AF un, we) + Mi, APT a") 


= Al + A, 


A*=—A. (1.2.18) 


We now construct a connection on a product bundle from connections on the fac- 
tors. If E; and E2 are vector bundles over M with connections D,, D2, we obtain 
aconnection D on E := E) ® E2 by 


D(1 @ 2) = Dip) ® w2+ 1 ® D212 
(4; €T(E;), $= 1,2). (1.2.19) 


We apply this construction to End(E) = E @E * to obtain a connection that is again 
denoted by D. For a section 0 = Oj Mi @ p*!, we then have 


Doi pi @ w*!) = doin; ® uw! +o At ® we! — oi Alu; @ w** 
=do +[A,o]. (1.2.20) 


Thus, the connection induced on End(£) operates via the Lie bracket. In a slightly 
different interpretation, we can view a connection D as a map 


D:V(E) > T(£)@Q2) (mM). 
Using the notation 
Q?(E) := T(E) @ Q?(M), 
we extend D to a map 
D:2?(E) > Q?+1(B) 
by 


D(uw) = Du Nw+ udw (1.2.21) 
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(where pw € T(E), w € 2?(M), and we have written wo in place of uw @c~ (mM) .’) 
The curvature of a connection D is now define as 
F := D?:2°(E) > Q°(E). 
D is called fla if 
F=0. 
Since the exterior derivative d satisfie (1.1.37), Le., 
dod=0 


we obtain the de Rham complex 


2402404... (QP =2°(M)). 


The sequence 
2 (FE) 3 ale) 2 o£) 4... 


however, is not necessarily a complex, since in general F 4 0. For w € T(E) 
(= Q°(E)), we compute 


F(u) = (d+ A)o(d+A)u 
= (d+ A)(dut Ap) 
=(dA)ju—Adu+Adu+AdAApm 
(the minus sign arises because A takes values in 1-forms), that is, 
F=dA+AAA. (1.2.22) 
If we write 
A= Aj dx! 
in local coordinates, with A; € P(gl(n, R)) = '(End(£)), (1.2.22) becomes 
_ 1 (4 dAj 


~ 2 ax! axl 
F is a map from 2°(E) to 22(E), ie., 


+[Ai, Ail) dx' Adx!. (1.2.23) 


F €97(E) @ (Q°(E)) = 2? (End(E)). 
Therefore, according to our rules (1.2.20) and (1.2.21), 
DF =dF +[A, F] 
=dANA—AAdA+[A,dA+AAA] (by (1.2.22)) 
=dAX\A—AAdA+AAdA—dAAA+4+[A, AAA] 


7We leave it to the reader to (easily) verify that (1.2.21) is well defined even though the decompo- 
sition 4 ®c~(y) @ is not canonical. 
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=[A,AAA] 

=[A;dx!, Ajdx/ A Agdx*] 

= Aj AjAg(dx! Adx/ A dx* — dx! A dx* a dx! 
— dx! A dx* ~ dxi +.dx* ~ dxi A dx') 

=0. 


We thus obtain the Bianchi identity: 


Theorem 1.1 The curvature of a connection D satisfie 


DF =0. (1.2.24) 


The Bianchi identity can also be derived in a conceptually more interesting man- 
ner from the equivariance of the curvature (f* Fp = Fy+p, Fp = curvature of D) 
under bundle automorphisms f, that is, diffeomorphisms commuting with the group 
action (cf. [95]). 

Using the notation of (1.2.13), we now wish to determine the transformation 
behavior of the curvature F of a connection D. From (1.2.14), 


Fy = dAg + Ag A Aq 
= d(vj4d ¢pa) + dG pg Ap Opa) 
+ Pad Ppa A Ppy db pa + Ypyd Ppa A pq Ap Ppa 
+ DpyAp A db pa + Ppy Ap \ Apopa- 
Because of 
A (Ppa) = — Ppa PboPpe 
the derivatives of ggq cancel, and we obtain 


Fos Pea Fe Qpa- (1.2.25) 


Thus, F transforms as a tensor, in contrast to A. 


1.2.3, Reduction of the Structure Group. 
The Yang-Mills Functional 


We now wish to implement the general principle formulated above that additional 
structures on the fiber of a bundle lead to restrictions on the admissible transfor- 
mations. In the previous section, G/(n, R) was the structure group of our vector 
bundle. This reflecte the fact that we only had a linear (vector space) structure on 
our fibers but nothing else. We shall now consider vector spaces with a structure 
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group G C Gi(n, R). The group G will then be interpreted as the invariance group 
of some structure on the fibers Let g be the Lie algebra of G. For a connection 
D on the vector bundle E with fibe IR”, we then require compatibility with the 
G-structure. To make this more precise, we consider local trivializations 


g:n (UU) >UxR" 


of E whose transition functions preserve the G-structure, that is, ones that trans- 
form G-bases 11, ..., 4n (meaning that the matrix with the columns j11,..., (tn is 
contained in G) into G-bases. Linear algebra (Gram-Schmidt) tells us that we can 
always construct such trivializations. In such a trivialization, we also require of 


D=d+A 
that 
AET(g@T*My). (1.2.26) 


Let us consider some examples. G = O(n) means that each fibe of E possesses 
a Euclidean scalar product (-, -). Via a corresponding local trivialization, for each 
x € U, we then obtain an orthonormal base e)(x),..., én (x) of the fibe V, over x 
depending smoothly on x, namely yg! (x, €],..-,€n), Where €1,..., €n 1s an ortho- 
normal base of R” w.r.t. the standard Euclidean scalar product. We then want that 
the Leibniz rule holds, i.e., 


d(o,t) = (Do,t) + (a, Dt), (1.2.27) 
that is, we require that (-, -) is covariantly constant. This implies in particular 
O=d(ej,e;)=(A ej,e;) + (ej, A ej), (1.2.28) 


that is, A is skew symmetric, A € o(n). A connection D satisfying the Leibniz rule 
is called a metric connection. 

Analogously, for G = U(n) we have a Hermitian product on the fibers and the 
corresponding Leibniz rule implies 


Aeu(n). (1.2.29) 


We then speak of a Hermitian connection. 

AdE is define to be the bundle with fiber (AdE), C End(V,) consisting of 
those endomorphisms of V,,. that are contained in G. AdE = P xq qg, where P is the 
associated principal bundle G acts on g by the adjoint representation. Analogously, 
Aut(£) is the bundle with fibe G, now considered as the automorphism group of 
V,, that is, 


Aut(E) = P xgG, 


where G acts by conjugation. (Thus, Aut(£) is not a principal bundle.) (The reason 
for this action is the compatibility with the action 


PxVxG>PxV, — (p,v)*g=(pg.g7!v), 
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because with (p, h) * g = (pg, g~'hg), we obtain g~'hg(g~!v) = g~! (hv), since 
G acts on V from the left.) Sections of Aut(E) are called gauge transformations, 
and the group of gauge transformations is called the gauge group. 

A section s € P(Aut(£)) operates on a connection D by 


s*D=s!oDos, (1.2.30) 

hence, for uw € T(E) 
s*(D)w=s !D(sp), (1.2.31) 

and, wih D=d+A 
s*(A)=s ldsts As. (1.2.32) 


In our present notation, the transformation rule (1.2.25) for the curvature F of D 
becomes 


s*(F)=s !oFos. (1.2.33) 


Here, we consider F as an element of 2?(AdE) =T'(AdE @ A?T*M), and s acts 
trivially on the factor A?T*M. The induced product on the fiber Ad, ® A?T*M 
that comes from the bundle metric of E and the Riemannian metric of M will be 
denoted by (. ,.). 


Definitio 1.5 Let M be acompact Riemannian manifold with metric g, E a vector 
bundle with a bundle metric over M, D a metric connection on E with curvature 
Fp € Q?(AdE). The Yang-Mills functional applied to D is 


YM(D) = (Fp, Fp)dvole. (1.2.34) 
M 


We now recall from Sect. 1.2.2 that the space of all connections on E is an affin 
space; the difference of two connections is contained in Q!(End£). Therefore, the 
space of all metric connections on E is an affin space as well; the difference of 
two metric connections is contained in Q!(AdE). For deriving the Euler-Lagrange 
equations for the Yang—Mills functional, the variations to consider are therefore 


D+tB_ with Be Q!(AdE). 
For o €T(E) = 2°), 
Fp418(0) =(D+1tB)(D +tB)o 
= D’o +tD(Bo)+tBA Do +1t7(BA B)o 


= (Fp +t(DB)+1?(BA B))o, (1.2.35) 
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as D(Bo) = (DB)o — B A Do. Therefore, 


d d 
—YM(D+tB)j=0 = — F _F = 
7 (D+tB)\t=0 ak D+tB, Fp+tB)|t=0 


= 2 foe. Fp). (1.2.36) 
Using the definitio of D* (1.2.17), this becomes 


dt 


This vanishes for all variations B if 


D* Fp =0. (1.2.37) 


d 
SYM(D +1B)y-0=2 | (B.D*Fo). 


Definitio 1.6 A metric connection D on the vector bundle E with a bundle metric 
over the Riemannian manifold M satisfying (1.2.37) is called a Yang—Mills connec- 
tion. 


In tensor notation, Fp = Fjjdx! A dx/, and we want to express (1.2.37) in local 
coordinates with the normalization g;;(x) = 4;;. In such coordinates, 


: ; OF;; ‘ 
d*(Fijdx! A dx/) = ——* dx/, 
Ox! 


and hence from (1.2.18) 


OF;; 
D* Fp = (- a 1A: Fyl)axt, 
ox! 
The Yang-Mills equation (1.2.37) in local coordinates thus reads 
OF;; 
; + +[Ai,Fij]=0 forj=l,...,d. (1.2.38) 
Xx 


In the preceding, we have define the Yang—Mills functional for metric connec- 
tions, i.e., ones with structure group G = O(n). Obviously, the same construction 
works for other compact structure groups, in particular for U(m) and SU(m). Those 
groups operate on the fiber of complex vector bundles. For a complex vector bun- 
dle, one has the structure group G/(m, C), that is, those of complex linear maps, 
and a Hermitian structure then, as explained, is a reduction of the structure group 
to U(m). We now consider complex vector bundles, as for them, we can defin im- 
portant cohomology classes from the curvature of a connection, the so-called Chern 
classes, as we shall now explain. Thus, E now is a complex vector bundle of Rank 
m, that is, with fibe C”, over the compact manifold M. D is a connection on E 
with curvature 


F=D*:2°>97(E). (1.2.39) 
F satisfie the transformation rule (1.2.25): 


a Ppa FeQap- (1.2.40) 
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Therefore, we can consider F as an element of AdE. Since E is a complex vector 
bundle with structure group gl(m, C), AdE = EndE = Homc(£, E). Thus, 

F eéQ?(AdE), (1.2.41) 


that is, F is a 2-form with values in the endomorphisms of £. Therefore, $F 
(the factor is simply chosen for convenient normalization) has eigenvalues Ax, k = 


1,...,m, which are 2-forms. We then defin exterior forms cj(E) € Q7/(M), 
j=l,...,m,onM via 
m . m 
‘ i 
Maer = dea( So +12) = [ [a + Agt). (1.2.42) 
j=0 an k=1 


From the Bianchi identity (1.2.24), ie., DF = 0, one concludes that dcj(E) = 0 
for all j. Thus, the c;(£) are closed and therefore represent cohomology classes. 
One also verifie that these classes do not depend on the choice of the connection D 
on E. These cohomology classes are called the Chern classes of the complex vector 
bundle E over M. Thus, from an arbitrary Hermitian connection on the bundle E, 
we can compute topological invariants of E and M. 

For j = 1, 2, we get 


c\(E) = —-trF, (1.2.43) 
20 
m—1 1 
c2(E) — Cc) (E) Aci (E) = —s tr(0 A Fo), (1.2.44) 
2m 872 
where 
1 
Fo:= F ——trF-Idg is the trace-free part of F. (1.2.45) 
m 


We now consider a U(m) vector bundle E over a four-dimensional oriented Rie- 
mannian manifold M. We let D be a Hermitian connection on E with curvature 
F = D’. As explained in (1.1.29), (1.1.31), we can decompose Fo into its self-dual 
and antiself-dual components 


Fo= Fj + Fo. (1.2.46) 


We recall (1.1.30), ie., that the exterior product of a self-dual 2-form with an 
antiself-dual one vanishes, and obtain 


tr(Fo A Fo) =tr(Fy A Fy) + tr(Fo A Fy) 


=-|FSP+1F 7? (1.2.47) 


by (1.1.29) (note that the trace is the negative of the Killing form of the Lie alge- 
bra u(m), that is, A - B = —tr(AB), which explains the difference in sign between 
(1.2.47) and (1.1.29)). 
From (1.2.44), we obtain by integration over M 
m— 


(2(4)- 


1 1 
c(E))[M] = —as fri? —|Fo P)Vedx' A---Adx". 
(1.2.48) 


46 1 Geometry 


The Yang—Mills functional then can be decomposed as 


1 
YM(D) = [(Giwer + Fo?) Vas" MeseKRaM? 
m 
M 


1 
= f (Sie r + iat ire?) veae! nd (1.2.49) 
m 
M 


Since tr F' represents the cohomology class —2zic;(£), the cohomology class of 
tr F is fi ed, and 


[irrPveds! a nax! 
M 


becomes minimal if tr F minimizes the L?-norm in this class (tr F therefore has 
to be a harmonic 2-form, see (1.1.115), (1.1.117)). Next, because of the constraint 
(1.2.48), that is, because the difference of the two integrals of Fe and Fy is fi ed 
by the topology of M and E, and therefore, f | Fo | becomes minimal if one of them 
vanishes, i.e., 


Fi =0 or F =0, (1.2.50) 


i.e. if Fo is antiself-dual or self-dual. Which of these two alternatives can hold de- 
pends on the sign of (c2(E) — m—1 1 (E)*)[M]. 

We now assume that the structure group of the complex vector bundle E is re- 
duced to SU(m). Thus, the fibe of AdE is su(m) which is trace-free. Therefore, if 
D is an SU(m) connection, its curvature F € Q?(AdE) satisfie 


trF =0. (1.2.51) 
Consequently, by (1.2.43) 
ci(E) =0, 
and by (1.2.44), (1.2.48) 


c2(E)[M] = — aa fart =(F- Pi/edx' Avi Ads, 
M 


Thus again, the difference of the two parts of the Yang—Mills functional is topolog- 
ically fi ed, and as in (1.2.49), 


ym(D) = f (Ft P +1F Pea! A+++ A dx! 
M 


is therefore minimized if F is (anti)self-dual; again, which of the two possibilities 
can hold depends on the sign of c2(E)[M]. We conclude that: 


Theorem 1.2 For a vector bundle E with structure group SU(m) over a compact 
oriented four-dimensional manifold M, an SU(m) connection D on E yields an 
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absolute minimum for the Yang—Mills functional if its curvature F is self-dual or 
antiself-dual. 


For a systematic presentation of four-dimensional Yang—Mills theory, we refer 
to [31]. 


1.2.4 The Kaluza—Klein Construction 


Here, we take up the discussion of Sect. 1.1.6 and combine it with a bundle construc- 
tion. The idea was firs put forward by Theodor Kaluza in order to unify gravity with 
electromagnetism. Although this was not successful in its original form, the general 
idea is still important and alive today. 

Kaluza’s ansatz was to consider, in place of the Lorentz manifold M, a fibe 
bundle M over M. Kaluza took the real axis R as the fibe . This was then modifie 
by Oscar Klein who chose the fibe S!, that is, the compact Abelian Lie group U(1), 
and this is also what we shall do here. Subsequently, we shall consider more general 
fibers Following the physics literature, we shall always assume that M is a principal 
fibe bundle. 

We obtain a metric 


g=n*g+A@A (1.2.52) 


on M where x : M — M is the projection, g is the Lorentz metric on M, and A is 
the 1-form for some U(1) connection on M. (More precisely, A=m*A where A is 
the connection form on M.) As in Sect. 1.1.6, we take the total scalar curvature as 
our action functional, that is, 


L(g) = [ R/—gdx°.--dxdé, (1.2.53) 
M 


where R is the scalar curvature of g and & is the fibe coordinate. To rewrite this 
functional, we firs give the formulae for the Ricci curvature of g. Let V be a unit 
vector fiel in the fibe direction. Because of the form (1.2.52) of the metric, this 
simply means that V is dual to A. For each tangent vector fiel X on M, we consider 
the horizontal lift X,, determined by 


X= X and a(Xp, V) =0. 
Let F be the curvature form for the connection A, i.e., 
F=dA (anda*F =dA) 


(note that A is a U(1) connection, hence Abelian, and so, here we do not have an 
A A A term in the formula for the curvature). 
We then have, for the Ricci tensor R(., -) of g, 


R(Xn, Yn) = R(X, Y) +2F 0 F(X,Y), (1.2.54) 
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where R(-,-) is the Ricci tensor of M, and in local coordinates with 


F = Fog dx® A dx®, 


we have 
(F 0 Fp = 8”? Fay Fog = —8"° Foy Fs (1.2.55) 
and 
R(X, V) = —d* F(X), (1.2.56) 
R(V,V) = |F/? = FY Fog. (1.2.57) 


In particular, the scalar curvature satisfie 
R=trR(,-)=R-—|F/, (1.2.58) 


where, of course, R is the scalar curvature of g. Upon integration over the fibers 
(1.1.177) hence becomes 


cia) = f (R- IPP) y=Bas, (1.2.59) 


that is, the sum of the Einstein—Hilbert functional of the base and the Yang—Mills 
functional of the fibe . If the Einstein fiel equations for the vacuum hold for such 
a metric on M, then, by (1.1.162), M has to have vanishing Ricci curvature, and then 
by (1.2.57), F has to vanish. Since F is supposed to represent the electromagnetic 
field this does not constitute a desirable physical consequence of this ansatz.® 

We can extend this construction to principal fibe bundles 7 : M — M with com- 
pact non-Abelian structure group G. For that purpose, let g’ be a G-invariant metric 
on the fibe , which we can then extend to all of 7M with the help of a connection 
(given by a 1-form A). Let g again be a metric on the base M. For tangent vectors 
W, Z on M, we put 


&(Z, W) = g(04Z, 1.W) + g'(Z, W) 


(where Z, W denote the projections of Z, W onto the fibers) obtaining a metric 
on M. If U and V are tangential to a fibe , we obtain, with notation analogous to 
that above, 


R(Xh, Yn) = R(X, Y) — 287? 9! (Fay, Fps)X°Y? 


a a 
(x=xSyerS), (1.2.60) 


ox” ox” 


8In an alternative interpretation, one might consider g as consisting of g and A and interpret the 
Euler-Lagrange equations for (1.2.59) as coupled Einstein—Maxwell equations for the metric g 
and the potential A. In that case, the undesired consequence that F has to vanish does not follow, 
but then we have a coupling rather than a unificatio of gravity and electromagnetism. 
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R(Xn, V) = —g'(d* F(X), V) (1.2.61) 


RU, V)=R'(U,V)+ det(g”)? ~ g' (Fop, U)g' (Fog, V). (1.2.62) 
or, B 


The action functional becomes 
L(z) = i (R + R! — |F |?) dvol jy. (1.2.63) 
M 


(Here, R is integrated on the base and the result is multiplied with the volume of the 
fibe , whereas R’ can be integrated on any fibe , by G-invariance, and the result is 
multiplied with the volume of M—assuming M to be compact again.) 

The Einstein fiel equations for the vacuum now no longer require the vanishing 
of F. |F|? has to be constant, however, when those equations hold, and base and 
fibe must have constant scalar curvature. In fact, taking the trace in (1.2.62) yields 
constant scalar curvature in the fibe direction when the fiel equations hold, and 
because the scalar curvature of the metric on the fibe bundle is constant, the scalar 
curvature in the fibe direction also has to be constant. Taking the trace in (1.2.60) 
then yields constant scalar curvature on the base. 


1.3 Tensors and Spinors 


1.3.1 Tensors 


We have already encountered the tangent bundle TM of a manifold M; its dual 
bundle is the cotangent bundle T*M. The fibe of the tangent bundle over p € M is 
the tangent space T, M, and the fibe of the cotangent bundle is the cotangent space 
T*M. 

Pp 


Definitio 1.7 A p times contravariant and q times covariant tensor (field on 
a differentiable manifold M is a section of 


TM®---@®TM@T*M®:---@T*M. (1.3.1) 
ee 
p times q times 


We recall that on a complex manifold, we have the decompositions 
T°M=T'M@T'M, T*M=T*MOT* M, (1.3.2) 


which are invariant under (holomorphic) coordinate changes, and the transformation 
rules (1.1.86), 


azk 


= awit 


. Oz i 
dzi= > dw’, dz 


are 


dw™ (1.3.3) 
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when z = z(w). We can therefore also speak of covariant tensors of type (r,s), 
meaning sections of 


T*M®---@T*M@T* M®-:-@T* M. (1.3.4) 
a ee 
r times s times 


(Contravariant tensors are define analogously, with the tangent bundle in place of 
the cotangent bundle.) 

For simplicity, we now consider the case of complex dimension 1, that is, of 
a Riemann surface, in order not to have to bother with too many indices. The reader 
will surely be able to transfer the subsequent considerations to the case of an arbi- 
trary (finite dimension. We return to the conceptualization of variations described 
in (1.1.22), (1.1.39), (1.1.41) and perform a variation 


zee ztef(z)=:z+6€6z (1.3.5) 


with a holomorphic f. We want to determine the induced variation dq of an (r, s)- 
form, that is, of an object of the type 


Q(z, Z) = w(z, Z)(dz)" (dz)°. (1.3.6) 


Here, r and s are called the conformal weights of w. Analogously to (1.1.41), we 
obtain the induced variation 


5p p2(z, 2) = (r(G. f) + 9(azf) + FO. + Faz) QZ, B). (1.3.7) 
r +s is called the scaling dimension, because for z+ 4z,A ER, 
Q = w(z, Z)(dz) (dZ)8 & ATS w(Az, AZ) (dz) (dZ)°. (1.3.8) 
r —s is called the conformal spin, because for z eiPz, 
Q = wz, D(z)" (d2)8  eO— 9? ew (e7 z, e!9 2) (dz)! (dd. (1.3.9) 


1.3.2 Clifford Algebras and Spinors 


Let V be a vector space of dimension n over a f eld F, which we shall take to be R 
or C in the sequel, equipped with a quadratic form Q: V x V > F. We then form 
the Clifford algebra C/(Q) as the quotient of the tensor algebra D,.) V @---@V 


of V by the two-sided ideal generated by all elements of the form ae 
v@v— Q(v, v). (1.3.10) 
In other words, the product in the Clifford algebra is 
{v, w} :=vw+wv=20(v, w). (1.3.11) 
Let e1,..., é, be a basis of V. This basis then satisfie 


eye; + eje; =2Q0(E;, e;). (1.3.12) 
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The dimension of C/(Q) is 2”, a basis being given by 
C0:=1, Ca,Car- Coy, With l<ay <---<ay <n. (1.3.13) 


We defin the degree of ey, --- eq, to be k. The degree of eo is 0. We let ck(Q) 
be the vector space of elements of C/(Q) of degree k. We also let C/°’(Q) and 
Cl°¢4(Q) denote the space of elements of even and odd degree, resp. We have 


cP(Q)=RorC 


(1.3.14) 
cl'(Q)=V, 
whereas 
CP (Q) =: spin(Q) (1.3.15) 
is a Lie algebra with bracket 
[a, b] := ab — ba. (1.3.16) 
It acts on C/'(Q) = V via 
T(a)v := [a, v] =av — va. (1.3.17) 
(Using (1.3.11), one verifie that for a € C?(V), v € CI'(V), we have av — va € 


cI'(V).) 

The simply connected Lie group with Lie algebra spin(Q) is then denoted by 
Spin(Q) and called the spin group. According to the general theory of represen- 
tations of Lie groups (see e.g. [45]), representations of spin(Q) lift to ones of 
Spin(Q). 

Example 
1. Q=0: This yields the so-called Grassmann algebra with multiplication rule 
DV; + DV; =0, 


for some basis #),..., On. 
2. For F =R, consider the quadratic form Q with 


1 fori=1,...p, 


ej,eé)= i,e;)=0 fori¢ j 
O(e;, ei) af fre eto. Q(e;, ej) ifxj 

for some basis e1,..., én of V. Putting g := n — p, we denote the corresponding 
Clifford algebra by 


Cl(p, q). 


p = 0 yields the Clifford algebra C/(0,) usually considered in Riemannian 
geometry. Of course, for given n, the Clifford algebra C/(p, q)(p + q =n) de- 
pends on the choice of p € {0,...,}. This is no longer so for the complexifica 
tion 


Ch (n):=Cl(p,q)@xrC (p+q=n). 
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In fact, we have 
Ci (m) = C2"*2" for m = 2n, 


CE (m) = C2"*2" @ C2"*2"_— for m =2n +1. 


We defin the Pauli matrices 


ov = (( i a=(\ i) a=(5 2) n= (¢ .) 

0 1)’ 1 O/}’ i 0}? 0 -l1)° 
(1.3.18) 

They form a basis of the space of 2 x 2 Hermitian matrices. We have 


{o;, aj} = 0j0; + ojo = 25; ;00 fori, 7 =1,2,3. (1.3.19) 


(Note the + sign here: {o;, o;} is an anticommutator, not a commutator.) 
The correspondence 


eo t> O09, ej > O71, €2 > 03, e1@2  —I102 
thus yields a two-dimensional representation of C/(2, 0), whereas mapping 
ejb ol, e2 +> 102, €1e2 > —03 
yields one of C/(1, 1) and 
el; b> id}, €2 > 102, e€1e2 > —103 


yields one of C/(0, 2). The representations of C/(2, 0) and C/(1, 1) are both isomor- 
phic to the algebra of real 2 x 2 matrices, whereas that of C/(0, 2) is isomorphic 
to the quaternions H. In particular, for later reference, we emphasize that we have 
displayed here real representations of C/(2,0) and C/(1, 1). 

Looking at C/(2, 0), which will be of particular interest for us, and extending the 
representation to the complexification we make the following observation which 
we will subsequently place in a general context. ie;e2 is represented by o2, and 
it anticommutes with both e; and e2. Therefore, the representation of CP (2,0) = 
spin(2, 0) leaves the eigenspaces of ie;e2 invariant. In contrast, e; and e2, that is, 
the elements of C/' (2,0), interchange them. (In particular, as a representation of 
spin(2, 0), the representation is reducible; the two parts themselves are irreducible, 
however. Here, this is trivial, because they are one-dimensional, but the pattern is 
general.) The eigenvalues of ie;e2 are +1, and its eigenspaces are generated in our 


representation by the vectors 
(7) and ( ) : 
U —l 


eo O00, enti gy €3 > 03 


The correspondence 


yields a two-dimensional representation of C/(3, 0). 
We defin the Dirac matrices 


0_ {90 0 pe 0 Oj Bis St 
v=(5 Ss y aes 0}: for j = 1,2, 3, 
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0 «a 
y S miytyly2y = (8 ae 


where each 0 represents a 2 x 2 block; i.e., the y! are 4 x 4 matrices. The matrix y° 


is Hermitian, while y!, y?, y? are skew Hermitian. (This is expressed in the formula 
yyy? =y"" for w= 0, 1, 2,3.) They satisfy 


{y®, v9} =27 = {y>, y}, 
{yi y/}=-21 for j =1,2,3, 


{y',y}=0 fori ¢k, 


where / is the 4 x 4 identity matrix. Thus, we obtain a four-dimensional represen- 
tation of C7(1, 3) and CI“ (4), called the Dirac representation, by 


ej be yi} fori =1,2,3,4. 


(Note: it might be better to denote the Dirac matrices by y!,..., y+ instead of 
y°,...,y°. Here, however we follow the convention in the physics literature; 
y> will subsequently be denoted by T when we consider arbitrary dimensions.) 
We also consider 
1 
HV _ 7, av 
o ae ae 


where [., .] is an ordinary commutator. (Note: in the physics literature, there is an 
additional factor i in the definitio of 0“”.) 
In the Dirac representation, we have 


; 0 «6 
oi = ic a 2 == 2a hie 6 ) 


1 if (@, j,k) is an even permutation of (1,2, 3) 
@ =j-1 if(,J/,k) is an odd permutation of (1, 2, 3) 
0 otherwise. 


In the Weyl representation, we instead defin 


0 —o F 0 0; 

0_ 0 i pois J am 

y es 0 ). y (2 ae for 7 = 1,2, 3, 
Jes 0: 5923 a 6 COO 

Y =iy yyy =, a: 


In this case, we have 


a _ {a 0 
- a ae = j=-Deni(G ey 


Therefore, the action of the o”” is reducible into two subspaces of (complex) di- 
mension 2 each. Finally, we have the pseudo-Majorana representation, where all y” 
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are purely imaginary: 


We now wish to consider representations of C/(2n, 0) and CI (2n) more abstractly. 
We consider the algebra generated by a basis y1,..., /2n of R2” satisfying 


{Yu Wt = 26uv 


and set 


1 1 
ai= ra +iy2), al = rata —ty2), 


1 : 1 . 
ani = 7 Pn-1 + iy2n), ah = 5 (Y2n—1 —iy2n) 


in R*” @ C. In the physics literature, the a;, at are called fermion annihilation 
and creation operators. We equip C” with the coordinates z! = x! + ix?,...,z”= 
x?2—l 4 ix?" We let AO-”C" be the space of (0, g)-forms, i.e. the vector space of 
differential forms generated by 


dz nw. Adz4, 1<iy <+++<ig<n (dz! =dx! —idx?, etc.) 
We let e(dz/ ) denote the exterior multiplication by dz/ from the left, i.e., 
e(dz/)(dzl A---Adzi4) =dzi Adz n+. Adz, 


sending (0, q)-forms to (0, q + 1)-forms. Likewise, we let udzi ) be the adjoint of 
e(dz/) w.rt. the natural metric on C”; thus 


i(dz/)(dzit A. Adzit) 


- 0 =e if 7 Z{ji,..-, jg}, 
(-1)!ldzit Aw Adzih# A+ Adz) if j= jy. 
We then obtain the desired representation by 
al i e(dz/), 


ap be u(dz*). 
Of course, one verifie that the formulae 
faj,aj}=0,  faf,al}=0, — {a;,a}} = 5) 
are represented by 


{e(dz/), e(dz*)}=0, — {u(dz!), (dz*)}=0, ——{e(dz/), (dz*)} = 5 jx. 
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The space 


n 
S:= ACIC" := B AODEr 
q=0 


on which C/©(2n) thus acts is called spinor space. The elements of S are called 
((complex) Dirac) spinors. Since by (1.3.14), V is a subspace of its Clifford algebra, 
it therefore operates by multiplication on any representation of that Clifford algebra, 
in particular on S. This is called Clifford multiplication. 

The representation S is not irreducible as a representation of spin(2n, 0), how- 
ever. To see this, we consider the “chirality operator” 


T:=ity, “+ Y2n 
(for 2n = 4, one often writes ys in place of T as explained above) 


(with the usual exponential series, we can also write T = exp(iz N), with the “num- 
ber operator” N := )7'_) a\aj). 


(T,yu}=0 forw=l,...,2n, 
r?=1. 


Thus, we may decompose C/©(n) into the eigenspaces C/©(n)* of I for the eigen- 
values +1, and these eigenspaces are interchanged by Clifford multiplication with 
any v € C”\{0}. Thus 


1 
Pe :=-0U4+T 
5 ) 
project onto the eigenspaces of I’, and we get a corresponding decomposition 
S=StOS- 
into “positive and negative chirality spinors” (also called right- and left-handed 
spinors), or “Weyl spinors”. If p — g =0,1,2 mod 8, one may also fin a real 
representation of C/(p,q). The corresponding spinors are called real or Majorana 
spinors. An important example is n = 4, p = 3,q = 1. Likewise, for g — p=0, 1,2 
mod 8, there exist imaginary or pseudo-Majorana spinors. 


The Lie algebra so(n) consists of skew symmetric matrices. It is generated by 
the matrices M’! with coefficient 


(M4) qs = 615) — 8465},. 
They satisfy the commutation rules 
(Mii, Me) = —sik i! + 5k il + gl ik — 5) ik, 


These rules are also satisfie by 


1 
Oj = qr; — Vivi) 
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where yj,..., ¥ are a basis of IR” with {y;, y;} = —6;;. (Note that this differs by 
a factor —5 from the convention employed in the definitio of the Dirac and Weyl 
representations above.) 

Thus 

M 2 F> O7 j 
yields a representation of so(n) on R”; in fact, we may identify so(n) with 
spin(0,). Since spin(0,n) = CP (0,n) we thus get an induced representation of 
so(n) on the spinor space S. This representation, however, does not lift to one of 
SO(n), but only to one of Spin(0, n), the two-sheeted cover of SO(n). 

In the case when n is even, since each oj; is a sum of products of two y;, and 
since Clifford multiplication with each y; interchanges the eigenspaces of S~ of T, 
oj; leaves these eigenspaces invariant. 

To summarize: We have established an isomorphism so(n) <— spin(n). Thus, 
s0(n) operates on C/ '(0, n), and each representation of the Clifford algebra C/(0, 1) 
therefore induces a representation of so(n). In particular, in this manner, we obtain 


the spinor representation of s0(n) (which induces a double valued representation of 
SO(n)). 


Remark The presentation here partly follows that of [22]. The original reference for 
Clifford modules is [6]. 


1.3.3 The Dirac Operator 


As explained, since the vector space V is a subspace of the Clifford algebra C/(Q), it 
operates on any representation of that Clifford algebra. We can thus multiply a vec- 
tor, an element of V, with a spinor, an element of S, that is, we have Clifford multi- 
plication 


VxS—oS. (1.3.20) 


In fact, since multiplication by an element of V interchanges St and S~, we have 
an operation 


V xs? SF, (1.3.21) 


Denoting the representation by y and letting ra be the partial derivative in the 
direction of e;, we can defin the Dirac operator 


D=yeor, (1.3.22) 
Ox 


which operates on spinor fields The square ) o — of the Dirac operator is then 
a linear combination of second derivatives; that linear combination depends on the 
quadratic form Q definin the Clifford algebra. If the quadratic form Q is repre- 
sented by the identity matrix, that is, if we consider the Clifford algebra Cl(n, 0), 
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the square of the Dirac operator is the (negative definite Laplace operator (see 
(1.1.103), (1.1.105)) 


a eee (1.3.23) 


In order to develop some structural insights, it is now useful to start with the com- 
plex case, or more precisely with a complex vector space V with a nondegenerate 
quadratic form Q. As Q is nondegenerate, it induces a nondegenerate bilinear form 
(.,.), w.r.t. which V is self-dual. Moreover, on a representation S of the Clifford al- 
gebra C/(Q), we can fin a nondegenerate bilinear form (., .) that is invariant under 
multiplication by v € V = C/'(Q): 


(vs, t) = (s, vt) (1.3.24) 


for all s, t € S. We can then use (., .) to identify S with its dual S*, and (1.3.20) then 
induces morphisms 


r:S*x S*oV (1.3.25) 
and 
r:SxS-V. (1.3.26) 


In (1.3.25), to any two elements of S*, we assign a vector v € V that operates on 
a pair o, t of elements of S by (vo, T), cf. (1.3.20), (1.3.24). 
Using bases {s,} and {e,,} of S* and V, we write (1.3.25) as 


T'(5q, 5p) = Te Gy (1.3.27) 


These morphisms are symmetric and equivariant w.r.t. the representation of C/(Q). 
Turning to the real case, the situation is not as convenient: we cannot always fin real 
versions of these morphisms; they only exist in certain cases. This depends on the 
classificatio of Clifford algebras. They always exist for the Minkowski signature, 
that is, for the Clifford algebra C7(1,n — 1), in any dimension n. They also exist for 
C1(2, 0), the case of particular interest for us. 


1.3.4 The Lorentz Case 


Let us also exhibit the relation between the orthogonal group and the spin group 
in the Lorentz case. There exist many references on this topic, including the clas- 
sic [81]. Let x = (x°, x!, x7, x3) ER! 3. We put 

goes? =22! 27x? = 3727, (1.3.28) 
The subgroup of G/(4, R) that preserves (x, x) is the Lorentz group O(1, 3). It con- 
sists of two components that are distinguished by the value of the determinant, +1 
or —1, and have otherwise the same properties. Thus, we consider the identity com- 
ponent SO(1, 3) where the determinant is +1, without essential loss of generality. 
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We shall see that the corresponding spin group is S/(2, C), the group of complex 
2 x 2-matrices with determinant 1. To x, we associate the Hermitian matrix 


0 3 1 +\2 
un [x +x X° 1X 
Pa 4 zl Cee oe) (1.3.29) 
where oo, ..., 03 are the Pauli matrices. We firs note that 
(x, x) = det X. (1.3.30) 
Since we have 
Tr(Oyov) = 2p, (1.3.31) 
we obtain 
1 
x’ = 5 Tr(Xo,,) (1.3.32) 


as the inverse of the equation expressing X in terms of x. 
In the physics literature, one writes the Hermitian matrix X as 


xii x2 
& eae (1.3.33) 


By the Hermitian condition, X oB — XB& go that X!! and X”2 are real. In this nota- 
tion, (1.3.32) becomes 


x0 = (KX +X"), xh 5 (KP +X"), 


xm (x! — x2), = 5x) — x4), 


We may use the relation (1.3.29) between a vector x and a Hermitian matrix X 
to defin an operation of S/(2,C) on R!3 as follows: 
For A € S/(2, C), we put 


T(A)X := X’:= AXA’. (1.3.34) 
With indices, this is written as 
X'e* = Ag At xP? (1.3.35) 


Here, the dotted indices refer to the transformation according to the conjugate com- 
plex of A, and this then explains the convention employed in (1.3.33). The fact that 
two As appear in (1.3.34) suggests that one consider this expression as a product: 
Instead of the 4-vector X, we take two spinors ¢, x that transform according to 


P= Argh, y'%= Ay’. (1.3.36) 
Their product then transforms like X in (1.3.35), 
gx" = AG AE PY”. (1.3.37) 
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Using the above formulae, we can express (1.3.34) as a transformation of the 
vector x: 


ed I ' er ; 
x/H = 5 Tt(Xory) = gee On) = 5% Tr(Ao,A!lo,). (1.3.38) 

Thus, 
x’ = Bx, (1.3.39) 

with 
Beal aya At _ly AoyAt 1.3.40 
= 5 r(Ao, on) = 5 t(o,Ao,A') (1.3.40) 


as the trace is invariant under cyclic permutations. 

One may check from this that (1.3.34) induces a Lorentz transformation, but this 
can more easily be derived from the fact that (1.3.34) maps Hermitian matrices to 
Hermitian matrices and preserves the determinant (since A € S/(2,C) has deter- 
minant 1), and (1.3.30) then implies that (-,-) = x99 — x!x! — x?x? — x3x3 (see 
(1.3.28)) is preserved. 

Also, this yields a homomorphism 


T : S12, C) > SOC, 3) 


with kernel {+1} (+ identity in S/(2, C) leads to the identity in SO(1, 3) in (1.3.34)), 
and image the identity component of the Lorentz group.” S/(2, C) is the universal 
cover of the identity component of the Lorentz group, which is doubly connected. 
Therefore, in the physics literature, representations of S/(2,C) are usually consid- 
ered as double-valued representations of SO(1, 3). 

We also observe that the homomorphism T in (1.3.34) maps SU(2) to SO(3). 
Namely we have, for A € SU(2), 


Tr(T(A)X) = Tr(AX A‘) = Tr(AXA7!) = Tr(X) = 2x° 


in the notations of (1.3.29). Thus, T(A) preserves x, and since (x,x) = xPx9 — 


x!x! — x?x? — x3x3 is also preserved, it preserves 
xty! + oR + x3x3 


2 x3 space. As before, 


and therefore yields an orthogonal transformation of the x!, x 
this yields a twofold covering of SO(3), and SU(2) = Spin(3). 

Since SO(1,3) acts by automorphisms on R!? which can be considered as 
a group of translations, we can form the semidirect product SO(1, 3) x R'3 where 
(B,b) € SO(1,3) « R!3 operates on R!? via x + Bx +b and where “semidi- 
rect product” refers to the obvious composition rule. The group of all isometries 
of Minkowski space is the semidirect product O(1, 3) « R!3, the Poincaré group, 
but it suffice for our purposes to consider its connected component containing the 
identity. Again, it is covered by S/(2,C) x R!3. 


° The Lorentz group has four connected components, all isomorphic to SO(1, 3), and we obtain the 
other components by space- and time-like reflection from the identity component. 
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The irreducible unitary representations of S/(2,C) x R!> were classifie by 
Wigner. We sketch here those aspects of the representation theory that are directly 
relevant for elementary particle physics. A mathematical treatment to which we re- 
fer for further details and which emphasizes the applications in physics is given 
in [98], whereas a comprehensive presentation from the perspective of physics can 
be found in [103]. Since R!? is a normal subgroup of S/(2,C) x R!, the study 
of the representations proceeds by describing the orbits of the action of S/(2, C) 
on R13, identifying the isotropy group of a point on each orbit, called the “little 
group” in physics, and then findin the representations of those isotropy groups. We 
know from (1.3.28) that 


me =e a OO” aa x! He? = (1.3.41) 


is preserved by the action of S/(2,C) on R!3. In particular, each orbit must be 
contained in a level set of m?. Physically, m is the mass of the particle define 
by the representation, and it then suffice to consider the case m > 0. Using the 
identificatio (1.3.29) of x € R'3 with the matrix 
04 43 1_ 7,2 
ua fe +x xX” 1X 
Xi=x ou= (5% eae 


for m? > 0, we can select the point 


care: 


depending on whether x° > 0 or < 0. Since this is a multiple of the identity matrix, 
its isotropy group, that is, the group of matrices leaving it invariant under conjuga- 
tion, see (1.3.34), is SU(2). As described for instance in [45, 75, 98], the irreducible 
unitary representations of SU(2) come in a discrete family, parametrized by a half 
integer 


ek ean 
2 2 
which can be identifie with the spin of the particle. Thus, the class of representa- 
tions corresponding to an orbit with m? > 0 is described by the continuous parame- 
ter m” and the discrete parameter L from (1.3.42). 


A point on an orbit with m? = 0 is 


(0 0): 


and its isotropy group is define by the invariance condition 


(0 o)’=(6 0) 


This implies that A has to be of the form 


(1.3.42) 
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for some z € C, 6 € R. Looking at the conjugation 


gy ONT a) fe". O\- 71 eer" 
0 e}\o 1/0 ee) \o 1 J? 


we see that the isotropy group, denoted by E(2), is a double cover (because of the 
angle 20) of the group of Euclidean motions SO(2) x C. By the same strategy as 
before, for determining its representations, we should look at the orbits of the SO(2) 
action on C which are the origin 0 and the concentric circles about 0. The repre- 
sentations corresponding to the latter do not occur in elementary particle physics. 
So, we are left with the origin whose isotropy group, the little group, is SO(2). Its 
irreducible representations are all one-dimensional and labeled by 


i 
s=0,£5,41,... (1.3.43) 


where the factor 5 corresponds to the fact that the rotations were about an angle 20. 
The key for understanding the representations of SU(2) is the following. The Lie 
algebra su(2) is generated by 


ty t= 5H EADS (1.3.44) 
with the Pauli matrices o,,, see (1.3.18). They satisfy 
[tu tyl= Envolo (1.3.45) 


with the totally antisymmetric tensor €,,,.. The real matrices 


: : 0 1 ; ; 0 0 
en itn in) = (5 ae ey i= itn +in) = (j i (1.3.46) 


: 1/1 0 
h:=-i = 5 G 2): (1.3.47) 
yield a basis of the Lie algebra sI(2, IR) and satisfy 
[A,tJ=t, [h,t-]=—-t_, [t,t] = 2h. (1.3.48) 


From this, one deduces that when p is a representation of sl(2,C) on a vector space 
V and vy is an eigenvector of o(h) with eigenvalue A, then p(ti)v, are eigenvec- 
tors of p(h) with eigenvalues 2 + 1. One then find that the possible values of A 
are L,L —1,...,—L for some half integer L = 0, 5s 1,..., see e.g. [45, 75, 98]. 
Since the eigenvalues are nondegenerate, the dimension of this representation is 
then 2L + 1. 


1.3.5 Left- and Right-handed Spinors 


We now put the transformation rule (1.3.37) for the product of two spinors into 
a more general perspective that will be needed below in Sect. 2.2.1 for definin La- 
grangians for spinors. According to our previous general discussion, in the present 
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case of IR* = C?, the spinor space is a four-dimensional complex vector space, i.e., 
isomorphic to C+. We have already seen in Sect. 1.3.2 that the spinor representation 
is not irreducible as a representation of the spin group, but splits into the direct sum 
of two chiral representations, i.e., each spinor can be written as 


_ (WL 
= (2). (1.3.49) 


Wt is called a left-handed, wp a right-handed spinor. 
A € S/(2, C) then acts via 


Wie AVL, 
S239 (1.3.50) 
were (AT) lye. 
With A = (Al) j.ea1.25 we have 
Wie AT, 
wire Aiwk with ALA! = 6;;. 
From (1.3.40), we also get with the help of (1.3.31) 
Beo, = Alo, A (1.3.51) 


(note that here the summation convention is used even though the position of the 
indices is not right—it would be better to write the os with upper indices, but we 
refrain here from changing an established convention). Putting 


say=(4 alr v= (31). (1.3.52) 


the action of A is described by 
wt S(A)y. (1.3.53) 


In the Weyl representation, with 


we then have 
stray sty? (1.3.54) 
Finally (1.3.51) implies 
Sly#S = By’, (1.3.55) 
For two left-handed spinors (see (1.3.49)) ¢, x, 
bX := apo? x? (1.3.56) 


transforms as a scalar under the spinor representation; namely 
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cup ALb” AS x° = Ay AZ@Y x° — AL ASO” x? 
= (Aj A3— ATA})G'x? + (AAT — AZAD)O"x' 
= det A(p! x? — ¢°x!) 
=eyp*x? (since det A= 1 for A € SI(2,C)). 


Similarly 
OP Oiacin” (1.3.57) 
transforms as a vector, for 4 = 0, 1, 2,3. This can be better understood by consid- 
ering full spinors 
oa 
= : 1.3.58 
W (vi ( ) 


Following the physics notation, in the Lorentzian case, we defin y* as the complex 
conjugate of yw, and the Dirac-conjugated spinor as 


pi=w'y®. (1.3.59) 
In the Riemannian case, the y° is omitted, that is, 
via. (1.3.60) 


Thus, returning to the Lorentzian case, yw = Wlwr + Wry. Then in the Weyl 
representation, 


vy" (1.3.61) 
transforms as a vector. In fact, applying a transformation A € S/(2, C), we get 
WIS(A) Py" S(A)o = Wy S(A)'y?y#S(A)o 
=wWS!y"Sw by (1.3.54) 
= Biiy’a by (1.3.55), 


which is the required formula. Since py“o = Wly°y#w = Vi o,or - vhouor, 
we also see why (1.3.57) transforms as a vector. 


1.4 Riemann Surfaces and Moduli Spaces 


1.4.1 The General Idea of Moduli Spaces 


We start with some general principles; their meaning may become apparent only 
after reading the rest of this section, and the reader is advised to proceed when these 
principles are unclear and return to them later. It may be helpful, however, to try to 
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understand the sequel in the light of these principles. In any case, the present section 
is more abstract and has more of a survey character than the preceding ones. 

One is given a mathematical object with some varying structure. An example of 
such an object is a differentiable manifold S, and the structure could be a complex 
structure, a Riemannian metric—perhaps of a particular type—and so on. One wants 
to divide out all invariances; for example, one wants to identify all isometric metrics. 
The invariances usually constitute a (discrete or Lie) group. The resulting space 
of invariance classes is then a moduli space. This already suggests that there will 
be a problem (more precisely, singularities of the moduli space) caused by those 
particular instances of the structure that possess more invariances than the typical 
ones, for example, those Riemannian metrics that are highly symmetric. The reason 
is obvious, namely that for those instances, we need to divide out a larger group of 
invariances than for the other ones. 


Heuristic guiding principle 
The moduli space for structures of some given type carries a structure of the same 
type. 

So, for example, we expect a moduli space of Riemannian metrics to carry a Rie- 
mannian metric itself, a moduli space of complex structures to be a complex space 
itself, a moduli space of algebraic varieties to be an algebraic variety itself. 

Typically, the space of such structures is not compact, that is, these structures 
can degenerate. One then wishes to compactify the moduli space. The compact- 
ifying boundary then also contains (certain) degenerate versions of the structure. 
The choice of admissible degenerate structures—which need not be unique—can be 
subtle and should be carried out so that the resulting space is a Hausdorff space. 

Often, one also wishes to get a fin moduli space M fine. Let p be a point in 
the (ordinary, or coarse) moduli space M representing an instance g of a structure. 
M fine then should be the fibratio over M with the fibe over p being that g. 


1.4.2 Riemann Surfaces and Their Moduli Spaces 


A Riemann surface can be define in several different ways, that is, through differ- 
ent types of structures. While these notions turn out to be equivalent in the end, they 
lead to different approaches to the moduli space of Riemann surfaces and equip that 
moduli space with different structures, according to the above principle. We shall 
now explain these different structures and also illustrate why they are interesting, in 
particular how they lead to different mathematical constructions and applications. 
For more details and proofs, we refer to [64] and other references cited subsequently. 
A profound knowledge of Riemann surface theory is useful for understanding con- 
formal fiel theory and string theory mathematically. Let S be a compact differen- 
tiable orientable surface of genus p. If not explicitly stated otherwise, we assume 
p>. 

The basic point is that one and the same such differentiable surface can carry 
a continuum of different Riemann surface structures. That is, there are many pairs 
x1, X2 of Riemann surfaces that are both diffeomorphic to S, but not equivalent as 
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Riemann surfaces. The moduli problem then consists of definin and understanding 
the space of all such Riemann surfaces (modulo holomorphic equivalence). 


1. A Riemann surface & is a discrete (fixe point free, cocompact) faith- 
ful representation of the fundamental group 7; (5S) into G := PSL(2, R), deter- 
mined up to conjugation by an element of G. The moduli space is the space of 
such representations modulo conjugation. 

More precisely: A Riemann surface & is a quotient H/T’, where H = {z= 
x +iy €C: y > 0} is the Poincaré upper half plane and I is a discrete group of 
isometries with respect to the hyperbolic metric 


1 7 
yade di. (1.4.1) 


I is a subgroup of the isometry group PSL(2,R) of H.'° Here, PSL(2,R) = 
SL(2, R)/+1, acting on H via zt act with a,b,c,d satisfying ad — bc = 1 
describing an element of SL(2, R). l should operate properly discontinuously and 
freely. It thus should not contain elliptic elements. This excludes singularities of the 
quotient H/T arising from fi ed points of the action of I’. In order to exclude cusps, 
that is, in order to ensure that H/T is compact, parabolic elements (see insertion 
below) of I’ also have to be excluded. Thus, all elements of I’ different from the 


identity should be hyperbolic. 


Insertion: Here, a transformation z > “ath of H is called hyperbolic if it has 


two fi ed points on the extended real axis R = 0H U {oo}, parabolic if it has one 
fi ed point on R, and elliptic if it has a fi ed point in H. Since the fi ed points 
are computed to be %* d 4 av (a +d)? — 4, the transformation is hyperbolic iff 
|a + d| > 2. The standard example of a hyperbolic transformation is z+» 2z, with 
fi ed points at 0 and ov, and a parabolic one is given by z b> a which has its 


unique fi ed point at 0. A hyperbolic transformation y maps the laypeibolic geo- 
desic / between its two fi ed points p1, p2 (the semicircle through p; and p2 or- 
thogonal to the real axis) into itself, that is, it is a translation along the hyperbolic 
geodesic 7. We can then easily visualize the operation of y on H; it simply maps 
each geodesic orthogonal to / to another such geodesic orthogonal to /, with the 
shift already determined by the operation of y on /. When we consider the example 
z+ 2z, the invariant geodesic is the imaginary axis. The invariant geodesic in H 
becomes a closed geodesic on the surface H/T, with length given by the length 
of the shift. A parabolic transformation does not have a fi ed geodesic, but instead 
rotates any geodesic through its fi ed point into another such geodesic. Therefore, 
a parabolic transformation does not produce a closed geodesic in the quotient. 

I is isomorphic to the fundamental group 71(S). Thus, a Riemann surface is 
described by a faithful representation p of 2 1(S) in G := PSL(2, R). This essen- 
tially leads to the approach of Ahlfors and Bers to Teichmiiller theory. Here, we 
need to identify any two representations that only differ by a conjugation with an 


10The isometries of H are the same as the conformal automorphisms of H, because of the confor- 
mal invariance of the metric. 
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element of G. Thus, we consider the space of faithful representations up to conju- 
gacy. A representation can be define by the images of the generators, that is, by 
2p elements of G, and this induces a natural topology on the moduli space. In par- 
ticular, this allows us to compute the dimension of the moduli space: Each of the 
2p generator images is described by three real degrees of freedom (a, b, c,d sat- 
isfying the relation ad — bc = 1) which altogether yields 6p degrees of freedom. 
From this, we firs need to subtract 3, the degrees of freedom for one generator, 
because the generators a1, b,...,@p, Dp of | (X) are not independent, but satisfy 
the relation aybya;'by' “++ Apbpaz by! = |. We also need to subtract another 3 
to account for the freedom of conjugating by an element g of PSL(2, R). Thus, the 
(real) dimension of the moduli space of representations of 2; (2) in PSL(2, R) mod- 
ulo conjugations is 6p — 6. This moduli space of representations of the fundamental 
group yields the Teichmiiller space T,. The moduli space M, is a branched quotient 
of that space. 

Singularities of the moduli space arise when the image I’ of ¢ has more automor- 
phisms than such a generic subgroup of G (whose only automorphisms are given 
by conjugations). Degenerations arise from limits of sequences of faithful, that is, 
injective representations p, that are no longer injective. Just as the Riemann sur- 
faces are obtained as quotients H/T’, the moduli space M, itself is likewise a quo- 
tient T,,/C of the Teichmiiller space T, by a discrete group, the so-called mapping 
class group. (This Teichmiiller space T,, is a complex space diffeomorphic—but not 
biholomorphic—to C3?~>, The complex structure was described by Bers through a 
holomorphic embedding into some complex Banach space. For recent results about 
this complex structure, we refer to [14]. T,, parametrizes marked Riemann surfaces, 
that is, Riemann surfaces together with a choice of generators of the firs homology 
group. Since all automorphisms of a hyperbolic Riemann surface act nontrivially on 
the firs homology, Teichmiller space does not suffer from the problem of the mod- 
uli space, that Riemann surfaces with nontrivial automorphism groups can create 
singularities.) 

This approach is also useful because it can be generalized to moduli spaces of 
representations of the fundamental group of a Kahler manifold in some linear al- 
gebraic group G. This is called non-Abelian Hodge theory and leads to profound 
insights into the structure of Kahler manifolds. In particular, because such repre- 
sentations can be shown to factor through holomorphic maps, this leads to the at 
present strongest approach to a general structure theory of Kahler manifolds via the 
Shafarevitch conjecture, see, e.g., [70-72]. 


2. A Riemann surface & is a 1-dimensional complex manifold. The moduli 
space is the semi-universal deformation space for such complex structures. 

More precisely: A Riemann surface & is S equipped with an (almost) complex 
structure. The relationship with 1 depends on the Poincaré uniformization theorem, 
which states that each compact Riemann surface of genus p > | can be represented 
as a quotient of H as in 1. Conversely, each quotient H/T as in | obviously inherits 
a complex structure from H, since operates by complex automorphisms on H. 

The moduli space M, is then constructed as a universal space for variations of 
complex structures. This means that if N is a complex space fiberin over some 
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base B with the generic (=regular) fibe being a Riemann surface of genus p, we 
then obtain a holomorphic map h : By > M, where Bo C B are the points with 
regular fibers In this manner, M,, as a moduli space of complex structures, acquires 
a complex structure itself that is determined by the requirement that all these h 
be holomorphic. Ideally, we would also like to have a holomorphic map / fine : 
No > Mp, fines No being the space of regular fiber in N, mapping the fibe over 
q € Bo to the fibe over h(qg) in Mp, fine, but this is not always possible due to the 
difficultie with Riemann surfaces with nontrivial automorphisms. More precisely, 
Mp, fine does not exist as such. A slight modification however, leads to such a fin 

moduli space; namely, we only need to equip our Riemann surfaces additionally 
with some choice of a root of the canonical bundle in order to prevent nontrivial 
automorphisms. This is called a level structure. This gives a finit ramifie cover 
of M>. That cover is free of singularities and then yields a fin moduli space. (The 
Teichmiiller space briefl described above is also a singularity-free cover of the 
moduli space, but, in contrast to the fin moduli space just introduced, it is an infinit 

cover and therefore not amenable to the constructions and techniques of algebraic 
geometry.) It is more subtle to understand what happens at the singular fibers Here, 
we need a suitable compactificatio M, of M, through certain singular Riemann 
surfaces. This, however, is better understood through the subsequent approaches to 
M, described below. 

This construction is useful because, for example, it allows a geometric proof of 
the theorems of Arakelov-Parshin and Manin on the finitenes of the number of such 
fibration of genus p over a given compact base B and the finitenes of the number 
of holomorphic sections of any given such fibration see [69]. The idea is to show 
that because of the geometric properties of M, and Mp, there can only be finitel 
many such holomorphic maps h : B > M p or (after taking care of the above need 
to take finit covers) from N into a compactifie fin moduli space. 


3. A Riemann surface is an algebraic curve, described by homogeneous poly- 
nomial equations. The moduli space is the space of coefficient of these polyno- 
mials modulo projective automorphisms. 

More precisely, a Riemann surface can be locally described as the common zero 
set of two homogeneous polynomials in three variables. The relationship with 2 
depends on the Riemann—Roch theorem, which yields the existence of meromorphic 
functions. 


Insertion: We briefl describe the relevant concepts. A line bundle L on ¥ is 
given by an open covering {Uj}j=1,.., of & and transition functions g;; € O*(U;N 
U;) (O* denoting the nonvanishing holomorphic functions) satisfying 


ij 8jii=1 on U, NY; foralli, j, (1.4.2) 
Sip Sjk° &ki = 1 on U;N U; OU, for alli, j, k. (1.4.3) 
Two line bundles L, L’ with transition functions g;; and g/ j> Tesp., are called iso- 


morphic if there exist functions ¢; € O*(U;) for each i with 


8ij = © gy on each Uj NUj. 
; J 
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By multiplying transition functions we can defin products of line bundles. The 
Abelian group of line bundles on © is called the Picard group of &, Pic(2). The 
Picard group Pic(X) is isomorphic to the group of divisors Div(X) modulo linear 
equivalence. (Divisors are finit formal sums )° Ny Pa with ng € Z, py € X. The ad- 
dition in Z induces a group structure on these divisors. Divisors are linearly equiva- 
lent when their difference is the divisor define by a meromorphic function. This is 
verifie when one expresses a divisor D in terms of its local definin functions: 

Fi. ee 

{(w fd: = € OU ui}. 

Si 
The function jf; is meromorphic on U;. When py € U;, we require that f; has a zero 
(pole) of order ng at Py ifng > 0(< 0). At all other points, f; has to be holomorphic 
and nonzero. 

We put 


gijc= fi on U; NU; 
fi 
to defin a line bundle, denoted by [D]. 

Let L be a line bundle with transition functions g;;. A holomorphic section h of 
L is given by a collection {h; € O(U;)} of holomorphic functions on U; satisfying 


hj = gijhj on U; al U;. 


The zeros of a holomorphic section of a line bundle L defin an effective (i.e., all 
nj > 0) divisor E, and when L = [D], that divisor is linearly equivalent to D, that 
is, E — D is the divisor of a meromorphic function. In general, the zeros and poles 
of a meromorphic section of L defin a divisor D with [D] = L. The degree of 
a divisor is the sum of its coefficients and from this one then define also the degree 
of the line bundle [D]. Thus, the degree of a line bundle counts the zeros minus the 
poles of a meromorphic section. 
The Riemann—Roch theorem for line bundles is then 


Theorem 1.3 Let L be a line bundle on the compact Riemann surface & of genus p. 
Then the dimension of the space of holomorphic sections of L satisfie the relation 


h®(L) =degL — p+1+h°(K @L~') (1.4.4) 


where K is the canonical bundle of &, that is, the line bundle of holomorphic 
1-forms. 


The equivalent formulation in terms of divisors replaces h°(L) by h°(D), the 
dimension of the space of effective divisors linearly equivalent to D. 

Thus, the Riemann—Roch theorem can be viewed as an existence theorem for 
meromorphic functions, or, equivalently, for holomorphic sections of line bun- 
dles, whenever the right-hand side of (1.4.4) is positive. For example, deg K = 
2p—2and h°(K) = p, deg K” =2 deg K =4p—4 and h°(K’) = 3p—3 for p > 1; 
K? is the line bundle whose sections are holomorphic quadratic differentials, that 
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is, locally of the form y(z)dz* with a holomorphic g. Collections of holomorphic 
sections of a line bundle L defin mappings into projective spaces because a change 
of the local representation of L multiplies them all by the same factor. One then 
needs sufficient! many independent sections to make such a map injective and thus 
to defin an embedding of the Riemann surface into a projective space. In fact, one 
can show that every compact Riemann surface can be holomorphically embedded 
into CP?. Moreover, since by Chow’s theorem every complex subvariety of CP” is 
algebraic, our Riemann surface can then be represented by polynomial equations. 

The relationship with | again goes via 2, that is, via the uniformization theorem. 
That theorem, however, is of a transcendental nature and thus outside the realm of 
algebraic geometry. 

So, a Riemann surface becomes a (projective) algebraic variety in CP, the zero 
set of algebraic equations. Such equations of a given degree can then be character- 
ized by their coefficients As automorphisms of CP? lead to equivalent algebraic 
curves, one needs to divide these out. A difficult emerges because the automor- 
phism group of CP? is not compact. Building upon the ideas of Hilbert, Mum- 
ford [83, 84] then developed geometric invariant theory to obtain the moduli space 
of algebraic curves. One then obtains the compactifie Mumford—Deligne moduli 
space M, as the moduli space of so-called stable curves, see [25]. As a moduli 
space of algebraic varieties, it is an algebraic variety itself, in agreement with the 
general principle. 


4. A Riemann surface is a collection of branch points on the Riemann 
sphere S? with branching orders satisfying the Riemann—Hurwitz formula. 
The moduli space is obtained from those collections by factoring out automor- 
phisms of S”. 

More precisely: Via some meromorphic function (whose existence again comes 
from the Riemann—Roch theorem), a Riemann surface is a branched cover of $7, the 
Riemann sphere, which can also be identifie with CP!. Again, we need to divide 
out automorphisms, this time those of S?; they have the effect of moving the branch 
points around. This approach already led Riemann to count the number of moduli 
for Riemann surfaces of a given genus, that is, the dimension of the moduli space. 
This is explained in [51], for example. 


5. A Riemann surface is a finit algebraic extension of the fiel of rational 
functions C(x) in one variable over C. 

From the algebraic representation in 3, one deduces that the fiel k(2) of mero- 
morphic functions on & is a finit algebraic extension of the fiel of rational func- 
tions C(x) in one variable over C. More precisely, 


k(x) ~ C(x)Ly]/P (x, y) (1.4.5) 


for some irreducible polynomial P. For example, an elliptic curve, that is, a Rie- 
mann surface of genus 1, can be described by a cubic polynomial 


y? — x(x — 1) —A) (1.4.6) 


for some 4 € C — {0, 1}. For z € &, we let R, be those meromorphic functions 
that are holomorphic at z. R, is then a subring of k(X) and has a unique maximal 
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ideal given by those functions that vanish at z. This means that, conversely, we 
can start with the fiel k(x) and defin the points of © as the maximal ideals of 
local subrings of k(=), and we may defin a Riemann surface as a fiel of the form 
C(x)Ly]/P(, y) for some irreducible polynomial P. This encodes the functorial 
aspects: Let ©;, Xz be compact Riemann surfaces, and let 


p : k(X2) > k(X)1) (1.4.7) 


be a homomorphism whose restriction to C is the identity. Then there exists a unique 
holomorphic map 


h:X\—> Xp (1.4.8) 
with 
$( f(z) = fACG)) (1.4.9) 


for all z € D, andall f €k(Xp). 

This algebraic definitio of a Riemann surface, which goes back to Dedekind 
and Weber, has the advantage that C can be replaced by any other algebraically 
closed fiel as the ground field We may take finit field Z,, and we can consider 
an algebraic equation P(x, y) giving our Riemann surface as above, modulo p. 
Doing this for all prime numbers p simultaneously yields important insights into the 
algebraic properties of such equations, see [36], and this was at the heart of Faltings’ 
proof of the Mordell conjecture [35]. We can also take, instead of C, a fiel of 
meromorphic functions on some variety B, in order to obtain an algebraic curve over 
a function field In more elementary terminology, we now consider a polynomial 
P(x, y) whose coefficient depend on the variable w € B. We thus obtain a family of 
Riemann surfaces as in 2, but now from an algebraic point of view. The unificatio of 
those two possibilities of considering varying ground field (depending on a prime 
number p or on a variable w in some algebraic variety) leads to arithmetic algebraic 
geometry. 


6. A Riemann surface & is (define by) an Abelian variety with a principal 
polarization, its Jacobian, that can be identifie as the group of divisors of de- 
gree 0 on & modulo linear equivalence or, equivalently, as the subgroup of the 
Picard group of line bundles of degree 0. Since not every principally polarized 
Abelian variety arises in this manner as the Jacobian of some Riemann surface, 
however, the moduli space of the latter is only a subvariety of the moduli space 
of principally polarized Abelian varieties. By considering periods of holomorphic 
1-forms, we can associate to a Riemann surface a principally polarized Abelian va- 
riety, its Jacobian. By Torelli’s theorem, each Riemann surface is determined by its 
Jacobian. This means that we can identify a Riemann surface with this Abelian va- 
riety, and the space of Riemann surfaces becomes a subspace of the moduli space of 
principally polarized Abelian varieties. What is not so nice about this is that the so- 
lution of the Schottky problem, that is, the question of characterizing those Abelian 
varieties that are Jacobians of Riemann surfaces, is rather complicated [97]. 


Insertion: We explain the above concepts in some more detail. H°(X, Q!°) 
is the space of holomorphic 1-forms on our compact Riemann surface &, that is, 
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the holomorphic sections of the canonical bundle K. Thus, h°(Q!) := h°(K) = 
dime H°(d, Q!:°) = p by Riemann-Roch. 

Let a,...,@p) be a basis of H°(x,Q!°), and a\,b1,...,@p,bp a canonical 
homology basis for &. Then the period matrix of X is define as 


Ja %p °° Sy, % 


The column vectors of zr, 


r= ( ain... f ap) and Papi (f eitesel Ea), i=1,...,p, 
Gj Qj bj bj 


are called the periods of X&. P},..., Pp are linearly independent over IR and thus 
generate a lattice 


A:= {n) Pi +--+ +12p Pop, nj €Z} 


inC?. 
Definitio 1.8 The Jacobian variety J(X) of & is the torus C?/A. 


For each zo in &, we have the Abel map (a holomorphic embedding) 
ji: ua JC) 


Zz Zz 
J(@):= (/ aise f up) mod A, 
Z0 Z0 


Here j(z) is independent of the choice of the path from zg to z, since a different 
choice changes the vector of integrals only by an element of A. 

By the theorems of Abel and Jacobi, we obtain an isomorphism ¢ from the group 
Pic®(X) of line bundles of degree 0, that is, from the group Div?(X) of divisors of 
degree 0 modulo linear equivalence, into the Jacobian J(X) by writing a divisor D 


of degree 0 as 
D= SG — Wy), 
Vv 


where z,, w, € & are not necessarily distinct, and putting 
Zv Zp 
Qg(D) := (x/ ay up) mod A. 
v Wy v Wy 


7. A Riemann surface is a conformal structure on S, that is, a possibility 
to measure angles. Equivalently, it is an isometry class of Riemannian metrics 
modulo conformal factors. The moduli space is obtained by dividing the space 
of all Riemannian metrics on S by isometries and conformal changes. More 


with 
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precisely: As already discovered by Gauss, a two-dimensional Riemannian mani- 
fold define a conformal structure, that is, a Riemann surface. Different Riemannian 
metrics can lead to the same conformal structure, and so we need to divide out 
such equivalences. This is the approach of Tromba and Fischer, see [101]. Thus, we 
consider the space R, of all Riemannian metrics on S. As a space of Riemannian 
metrics, it carries itself a Riemannian metric. If g is a Riemannian metric on S and 
h:S— S isa diffeomorphism, h*g is isometric to g via h. Thus, we need to divide 
out the action of the diffeomorphism group D, of S. It acts isometrically on Rp 
equipped with its Riemannian metric. Moreover, when we multiply a given metric 
g by some positive function 4, the metric Ag leads to the same conformal structure 
as g. Such multiplication by a positive function, however, does not induce an isom- 
etry of Rp (and this is at the heart of the anomalies in string theory that ultimately 
force a particular dimension (26 in bosonic string theory)). 


Insertion: Some details: Let g € R», be some Riemannian metric on S. Suppress- 
ing the issues of the precise regularity class of the objects encountered, the tangent 
space T, Rp is given by symmetric 2 x 2 tensors h = (hj;). Each such h can be 
decomposed into its trace and trace-free parts: 


h=pgth’ p: SR, (1.4.10) 
1 
hi = hij — 5 8ii8" Ii. (1.4.11) 


The decomposition (1.4.10) is orthogonal w.r.t. the natural Riemannian structure on 
Te Rp: 
gitp: 


(hij), (Lip) gn 2= [oe + «gl gk" \Nj Lim det g dz! dz (1.4.12) 


with « > 0 and 


ijkm = aes 


g git gin 4 gimgik — gil g 
2 

Since the value of « will make no difference for us, we put k = 5 so that (1.4.12) 
becomes 


(hij), ig i= / gs gk ipl jm det g dz! dz’. (1.4.13) 
Ss 


As it stands, this is only a weak Riemannian metric on the infinite-dimensiona space 
Rp, as (1.4.13) yields only an L?-product, but Clarke [20] showed that it becomes 
a metric space with respect to the distance function induced by the Riemannian 
product of (1.4.13). (The completion of this metric space is identifie in [19].) 

From (1.4.13), we see that the Riemannian metric (.,.)¢ on T,Rp is invariant 
under the action of the diffeomorphism group, but not under conformal transforma- 
tions. 

In order to get rid of the ambiguity of the conformal factor, we need to fin a suit- 
able slice in R, transversal to the conformal changes. By Poincaré’s theorem, any 
Riemannian metric on our surface S of genus p > | is conformally equivalent to 
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a unique hyperbolic metric, that is, S becomes a quotient H/T as above. This met- 
ric has constant curvature —1. With some differential geometry, one verifie that —1 
is a regular value of the curvature functional, and so, by the implicit function theo- 
rem, the hyperbolic metrics yield a regular slice. Thus, we obtain the moduli space 
M, as the space Rp; of metrics of curvature —1 divided by the action of Dp. In 
this way, the geometric structures on Rp induce corresponding geometric structures 
on M, as described in Tromba’s book [101]. R, is the space of symmetric, positive 
definit 2 x 2 tensors (g;;) on S. As already explained, a tangent vector to Rp, is then 
asymmetric 2 x 2 tensor (hj;), not necessarily positive definite It is orthogonal to 
the conformal multiplications when it is trace-free, and it is orthogonal to the action 
of D, when it is divergence-free. Such a trace- and divergence-free symmetric ten- 
sor then can be identifie with a holomorphic quadratic differential on the Riemann 
surface. 


Insertion: Some details: We recall the decomposition 
h=pgt+h', (1.4.14) 


where h’ is trace-free. As we have seen, this decomposition is orthogonal w.r.t. the 
natural Riemannian metric on T, Rp. In particular, since we only want to keep those 
directions that are orthogonal to conformal reparametrizations, we only need to con- 
sider the trace-free part h’. We next consider the infinitesima action of the diffeo- 
morphism group, with the aim of determining those h’ that are orthogonal to the 
action of that group as well. For that purpose, let (y;) C Dp, go = id, be a smooth 
family of diffeomorphisms, generated by the vector fiel 


d 
Vizji= nt Oizo: (1.4.15) 
The infinitesima change of the metric g under (¢;) is then given by 
d 
7 fi Bit=0- (1.4.16) 


(This is the Lie derivative Ly g of the metric in the direction of the vector fiel V.) 
With V denoting the covariant derivative for the metric g, 


d 
ay Gt OUe=Oij = gix(V_a VE + Bi(V a) 


0z 
= ij. eV" + BikVE + BRS. (1.4.17) 


In the above decomposition of R,, the directions corresponding to conformal 
changes are given by the tensors pg, whereas those representing D, are of the form 
(1.4.17). It remains to identify the Teichmiiller directions, i.e., those that are orthog- 
onal to the preceding two types. 

Our computations simplify considerably if we use conformal coordinates so that 
the metric (g;;) is of the form 


gij (Z) =A (2)6i;. (1.4.18) 


If a symmetric tensor h” is orthogonal to all multiples pg of g, it has to be trace- 
free. If it is orthogonal to all tensors that arise from the infinitesima action of the 
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diffeomorphism group, that is, of type (1.4.16), we get, using the symmetry of h” 


0= i} gl gh (gjemV™ + 28 jimV2)Vdetg dz! dz? 


1 a : 


= [2 iV dz'dz’, since h’ is traceless. 


If this holds for all vector field V, we conclude 


a " ° 
gaettix=0 fori =1,2. (1.4.19) 
This means that h’’, is divergence free. 
Thus, h” is symmetric, trace-free, and divergence free. These conditions can be 
interpreted in a more concise manner as follows: 


Being symmetric and trace-free, h” is of the form 


W " 
Ay Myo\ (: v ) 
" ” —F _ . 
Nyy Nyy ee 
Being divergence free, this tensor then has to satisfy 


uz =—vz2, u,2 =v 


z zl- 


Thus, u — iv is holomorphic, or, as a tensor, 
h” =u(dz')? — u(dz’)? + 2udz! dz 
= Re((u — iv)(dz! + idz’)) (1.4.20) 
is the real part of a holomorphic quadratic differential 
gdz =(u—iv)dz’. 


Thus, we have identifie the tangent directions of R, that correspond to nontrivial 
deformations of the complex structure as the (real parts of) holomorphic quadratic 
differentials on the Riemann surface define by (S, g). 

Thus, the cotangent!! space of M p at a point representing a Riemann surface 
xX is given by the holomorphic quadratic differentials on X. (This issue will be 
taken up again in Sect. 2.4 from a different point of view that also clarifie the re- 
lation between tangent and cotangent directions to the moduli space.) The complex 
dimension of this space is 3 p — 3 the Riemann—Roch theorem. M, then also inher- 
its a Riemannian structure from that of R,. The induced metric is the Petersson— 
Weil metric originally introduced in the context of approach 1. In a more abstract 


"Tt is not very transparent from our preceding considerations that we have constructed the cotan- 
gent and not the tangent space, but a careful accounting of the transformation behaviors can clarify 
this issue. 
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framework, the so-called L?-geometry of moduli spaces is investigated in [67]. Let 
®@ dz? = (wu, — iv}) dz? and W dz? = (uz — ivr) dz” be two such differentials. Let 
p*(z) dz dz be the hyperbolic metric. Then their Petersson—Weil product is 


1 3 
(® dz, Wdz*)¢ =2 | (uur + v1 v2) - ——dzdZ=2Re | ®W—— dzdiz. 
&§ 2 2 
p-(z) p-(z) 
(1.4.21) 


We have now listed seven rather different approaches for definin what a Rie- 
mann surface is. It is a very remarkable and profound fact that all these approaches 
give fully compatible structures on the moduli space M,. In each of them, one 
can construct a complex structure on M,, and they all agree, and together with 
the Petersson—Weil metric, one then find a Kahler structure on My. 

Nevertheless, some remarks are in order here: 


e From an algebraic point of view, the hyperbolic metric is a transcendental ob- 
ject and should be replaced by an algebraic one. There are also certain other 
natural metrics on a Riemann surface, like the Bergmann metric obtained from 
an L*-orthonormal basis of holomorphic 1-forms, that is, the metric induced by 
embedding the Riemann surface into its Jacobian, or the Arakelov metric de- 
fine from an asymptotic expansion of the Green function, a rather natural object 
in string theory. One may replace the hyperbolic metric in (1.4.21) by another 
metric uniquely associated to each Riemann surface and still obtain a natural Rie- 
mannian metric on M,. First steps in the direction of a systematic investigation 
have been done in [54, 55]. For more recent results in this direction, see [58, 59]. 
Let us briefl describe some of these constructions here. The Bergmann metric is 
given by 


Pp 
ppdz Adz := > 6; AG; (1.4.22) 


i=l 


where the 6; are an L*-orthonormal basis of the space of holomorphic 1-forms 
on &, that is, 


5 [4 Ab; =i. (1.4.23) 
Equivalently, the metric is induced from the fla metric on the Jacobian J(X) 
via the period map j : X > J(X). This latter description also shows that it does 
not depend on the choice of orthonormal basis—which, of course, is also readily 
checked directly. Moreover, the expression for the Bergmann metric is indeed 
positive definite that is, it define a metric, or equivalently, the derivative of the 
period map j has maximal rank. This follows from the fact that there is no point 
on & where all holomorphic 1-forms vanish simultaneously; this can be deduced 
from the Riemann—Roch theorem. 
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The Arakelov metric (references are [4, 18]) y*dzdz is characterized by the 
property that its curvature is proportional to the Bergmann metric, 
a2 


2 
acas BY = phe: (1.4.24) 


for some constant c, that depends only on p and can, of course, be explicitly 
computed.!* Alternatively, it is given in terms of an asymptotic expansion of the 
Green function of the Bergmann metric, 


logy (z) = — lim 27G(z, w) — log|z — w)), (1.4.25) 


with G satisfying 

i 
ap G em w= 55w (2) + ppp, (1.4.26) 
where 6,, is the Dirac functional supported at w, plus the normalization! 


[oc w) 5 padz A dz = 0. (1.4.27) 
x 


The Green function is regular for z 4 w and becomes —oo at z= w. 
exp 27 G(z, w) vanishes to first-orde at z= w. The firs term in the expansion of 
exp 27 G(z, w) is the universal term |z — w|, while the next one, y(z), encodes 
the geometry of the Riemann surface ©. 

If Ag is the Laplace operator for the Bergmann metric, and if $0, @1,... is an 
L?-orthonormal basis of eigenfunctions with eigenvalues 0 = Ap < Aj <A2..., 
then the Green function is given by the expansion 


CO 1 ee 
GG, w) =) 6) (Gj). (1.4.28) 
J 


j=! 


In fact, one can perform this construction of the Green function and the associated 
metric on the basis of any conformal metric on & in place of the Bergmann one. 
Arakelov discovered, however, that the Bergmann metric is distinguished here by 
the following property: When we use the Green function of a metric g to defin 
a metric on the canonical bundle K by putting 


2nG(z, z0)\' 
lldz||(zo) := (im se) 


(1.4.29) 
zo |Z — Zo 


where the absolute value on the right-hand side is taken w.r.t. to local coordinates, 
that is, in C, then the curvature of this metric on K is a multiple of g if and only 


Tn the sequel, c p Will denote a generic such constant whose value can change between formulas. 


13 The characterization of the Arakelov metric in terms of its curvature likewise needs an additional 
normalization to fully determine it. 


1.4 Riemann Surfaces and Moduli Spaces 77 


if we started with the Bergmann metric. In other words, we have the formula 


2 


a) 
| 2dz A dZ=cpprdz dz 1.4.30 
ni acd: og lls ||"dz A dz = cpppdz A dz ( ) 


for any locally nonvanishing holomorphic section s of K, and this is no longer 
valid for other metrics g used to construct a Green function. 

More generally, for a line bundle L over & with transition functions g;;, a Her- 
mitian metric A? on L is a collection of positive, smooth, real-valued functions 
de on U; with 


AG =ApgijBiy on U; NU j. (1.4.31) 


The norm of a section h of L given by the local collection h; is then define via 


hi(zo)/? 
In(zo) I? -= say for z € Uj. (1.4.32) 
i «0 


The curvature or firs Chern form is given by 


b.a? 
c1(L, A?) = —-—— log ||h||?dz A dz (1.4.33) 
2mi 0z0Z 
for any meromorphic section h and local coordinates z, and this is independent 
of the choices of h and z. Arakelov called a Hermitian line bundle L admissible 
w.r.t. a metric p*dz A dz on & if 


c1(L, A”) = deg Lp” dz A dz. (1.4.34) 


Let zo € ¥, and let z be local coordinates mapping zo to 0. We can put a Her- 
mitian metric on the line bundle [zo] by definin the norm of the local section z 
in a neighborhood of zg as 


Iz|(z1) = exp G(Z1, Zo). (1.4.35) 


This metric is then admissible for the Bergmann metric. So, what is special about 
the Bergmann metric here is that if we start the construction of the Arakelov 
metric from the Green function of that metric then the curvature formula recovers 
that metric. This only holds for the Bergmann metric and not for any other one. 

e We noted in 6 that we can inject the moduli space M, of Riemann surfaces of 
genus p into the moduli space A, of principally polarized Abelian varieties of 
dimension p. The latter also carries a natural (locally Hermitian symmetric) met- 
ric. Since the map j : My — Ap, while being injective by Torelli’s theorem, is not 
of maximal rank everywhere, the pullback of that metric via j has some singular- 
ities. Also, its behavior is qualitatively different from that of the Weil—Petersson 
metric, as will become clear below when we investigate degenerations of Rie- 
mann surfaces and their associated Jacobians. 
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1.4.3 Compactifications of Moduli Spaces 


Some of the preceding approaches also naturally lead to compactification of Mp. 


1. We already mentioned the Mumford—Deligne compactificatio M, as an al- 
gebraic variety. It consists of so-called stable curves, that is, possibly singular 
curves, but with a finit automorphism group. The sphere with no, one, or two 
punctures and the torus are thereby excluded. This is necessary for the Hausdorff 
property. 

The difficult here can be seen from the following easy example: We con- 
sider annuli, and by the uniformization theorem, each annulus is characterized 
by a single modulus, a real number 0 <r < 1; that is, it is conformally equiva- 
lent to an annulus 


Ay = {z€ Cir <|z| <1}. (1.4.36) 


Thus, the moduli space of annuli is (0, 1). It seems obvious how to compactify it, 
namely by simply adding the boundary points r = 0 and r = 1. Now r = 1 does 
not correspond to a Riemann surface anymore, and so this is not a good limit. 
The annulus A;, however, is conformally equivalent to the annulus 
; 1 r 1 
A, t= ——Ay; = 47 € C: —— < [z| < ——f, (1.4.37) 
l-r l-r l-r 

which for r > 1 converges to an infinit strip, that is, the limit can be identifie 
with {x +iy €C:0 < y < 1}. The boundary point r = 0 seems harmless because 
it simply corresponds to the punctured disk 


D* ={zEC:0< |z| < 1}. (1.4.38) 
However, the annulus A; is also conformally equivalent to the annulus 
1 1 
A= az4,={zeC: ¥F <1 < zh, (1.4.39) 
and if we now let r tend to 0, the limit is the punctured plane 
C*={zeC:z40}, (1.4.40) 


which is not conformally equivalent to the punctured disk D*. Thus, from the 
same limit r — 0, we obtain two different limits, D* and C*, and therefore, we 
lose the Hausdorff property. Mumford’s insight was that this problem essentially 
arises from the fact that the putative limit C* has a noncompact automorphism 
group. In fact, its automorphism group contains all transformations of the form 
z— xz for any 4 € C*. Mumford’s theory then declared such limits as unsta- 
ble and disallowed them. The problem of the noncompact automorphism group, 
however, will re-emerge later when we consider conformally invariant variational 
problems. The essential point is the following: We consider any Riemann surface 
=x and choose local coordinates z in the open unit disk U = {z EC: |z| < 1} 
around some point po, such that po corresponds to 0. We then replace the co- 
ordinate z by z, := Az EAU = {z EC: |z| < A}. When we let 4 € R tend to 
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oo, we obtain Zo € C, but these are not coordinates for a local neighborhood of 
po anymore because any fi ed Zoo € C now corresponds to po itself. In a sense 
to be made precise, they thus parametrize an infinitesima neighborhood of po. 
We can compactify this infinitesima coordinate patch C by adding the point at 
oo to obtain the sphere S*. Thus, we have created a nontrivial Riemann surface, 
the sphere S*, by blowing up a neighborhood of our point po € ©. Again, if 
we allowed such processes in the construction of the moduli space, we would 
need to consider the union of © and S? as a limit of the constant sequence ©. 
(As this so-called “bubbling off’ can be repeated, we should then even allow 
for infinitel many blown-up spheres.) At this point, as mentioned, this can sim- 
ply be excluded by fiat but the situation changes when these blown-up spheres 
carry some additional data, for example some part of the Lagrangian action in 
a variational problem. 

2. We recall from 2 in Sect. 1.4.2 that if N is a complex space fiberin over some 
base B with the generic (=regular) fibe being a Riemann surface of genus p, 
then we obtain a holomorphic map h : By) > M, where Bo C B are the points 
with regular fibers The fiber over B; := B\Bo are then singular, and we hope 
to extend h across Bj, that is, obtain a holomorphic map h : B > M p: Cer- 
tain difficultie arise here from the possibility that not all such singular fiber in 
a holomorphic family need to be stable in the sense of Mumford. Thus, in partic- 
ular, we cannot expect that the image of some point in B is given by the complex 
structure of that singular fibe . Nevertheless, after lifting to finit covers so that 
the quotient singularities of M, disappear, one can extend h to a holomorphic 
map h : B + My. This depends on certain hyperbolicity properties coming from 
the negative curvature of the Weil-Petersson metric on M,, that lead to general 
extension properties for holomorphic maps, see [69]. 

3. While the preceding is a global aspect, one also has a convenient local model for 
degenerations of Riemann surfaces within 2. We consider two unit disks D; = 
{z€C: |z| < 1} and D2 = {w EC: |w| < 1}. Fort € C, |t| < 1, we remove the 
interior disks {|z| < |f|}, {]w| < |t|} and glue the rest by identifying z with w by 
the equation zw =f to obtain an annular region A;. For t > 0, A; degenerates 
into the union of the two disks D;, D2 joined at the point z = w = 0. This is the 
local model for degeneration. The connection with the consideration of families 
as advocated in the preceding item of course comes from considering the smooth 
two-dimensional variety N := {(z, w,t): zw —t=0, [z|, |wI, |¢| < 1} for which 
(z, w) yield global coordinates. N fiber over the base B := {ft : |t| < 1}, with 
a single singular fibe over Bo = {0}. 

This local model is easily implemented in the context of compact Rie- 
mann surfaces as follows. We let Xo be either a connected Riemann surface 
of genus p — | > 0 with two distinguished points x), x2, called punctures, or 
the disjoint union of two Riemann surfaces x! =? of genera p), p2 > 0 with 
Pi + p2 = p and one puncture x; € ! each. We choose disjoint neighborhoods 
U,, U2 of the punctures and local coordinates z: U; > D,, w: Uz > D2 with 
z(x1) = 0, w(x2) = 0. By performing the above grafting process on the coordi- 
nate disks D; and D2, we obtain a Riemann surface &, of genus p for t 4 0. The 
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correspondence 

tr dy, (1.4.41) 
induces a map of D* = {t € C: 0 < |t| < 1} onto a complex curve in the mod- 
uli space M, which extends to a map from D = {t € C: |t| < 1} = D* U {0} 


into the compactificatio M p- (Because of the genus restrictions imposed, these 
degenerations all yield stable curves.) 


. Approaches | and 7 suggested looking at the moduli space of hyperbolic metrics. 


A hyperbolic metric on a compact surface can degenerate into a noncompact but 
complete hyperbolic metric of finit area with cusps. In the local model described 
in the previous item, this looks as follows. On the annulus A; = {|t| < z < 1}, we 
have the hyperbolic metric 


log |z| 


dz Adz ( 7 Tog it| ) (1.4.42) 
|z|2 log? |z| sin(x Pee 


For |t| — 0, this converges to the hyperbolic metric on the punctured disk {z : 
0 < |z| < 1} given by 


dz dz 


ie log? [a] (1.4.43) 
This metric is complete at 0, that is, the cusp 0 is at infinit distance from the 
points in the punctured disk. Also, the area of every punctured subdisk {z:0 < 
|z| < p},0 < p < Lis finite 

For the hyperbolic metric (1.4.42) on the annulus A;, the middle curve 
|z| = </|t| is the shortest of all the concentric circles, hence a closed geodesic, 
denoted by c. The reflectio zt : is then an isometry leaving c fi ed. Its length 
I goes to 0 as t > 0, while its distance from the boundary |z| = 1 goes to oo. 
Thus, as ¢ goes to 0, the geodesic c degenerates into a point curve at infinit 
distance from the interior. Therefore, in geometric terms, the degeneration is 
described by pinching a closed geodesic on some annulus inside our Riemann 
surface equipped with the hyperbolic metric. In fact, a hyperbolic metric on an 
annulus that is symmetric about a closed geodesic is uniquely determined by the 
length of that geodesic. That means that the hyperbolic on the annulus A, is in- 
duced by the metric of the Riemann surface %,; as described above. Thus, even 
though we have presented it here as a local model, it captures the essential global 
aspects. 

This consideration of varying hyperbolic metrics leads to the same compact- 
ificatio M, of M, asa topological space, see [12]. The noncompact surfaces 
can be compactifie as Riemann surfaces by adding a point at each cusp. We thus 
see that elements in the compactifying boundary of M , correspond to surfaces of 
lower topological type with additional distinguished points, so-called punctures. 


Insertion: The degeneration can also be described in terms of the generators 
of the discrete group I’ considered in 1. Since hyperbolic elements are character- 
ized by |a+d| > 2 and parabolic ones by |a +d| = 2, the relevant degeneration is 
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one where we have a sequence I,, of surface groups with hyperbolic elements y,, 
converging to a parabolic element yo. An example is the sequence of hyperbolic 
transformations 


(+i)z+4 


: 1.4.44 
Yn +> oa ( ) 
converging to the parabolic transformation 
Yo: Zt (1.4.45) 


z+1 

In the limit, the two fi ed points of y, merge into the single fi ed point 0 of 
yo. Also, the length of the invariant geodesic for y, approaches 0 as n > oo. 
Thus, again, the degeneration is described by pinching a closed geodesic on our 
Riemann surface equipped with the hyperbolic metric induced from H. 

We now want to relate the geometric description of degeneration just es- 
tablished to the analytic model described previously. We firs describe how to 
get from the analytic model to the geometric one. The behavior of the hyper- 
bolic closed geodesic |z| = ./]f| for the hyperbolic metric (1.4.42) on the an- 
nulus A; translates into the following picture for hyperbolic isometries of H. 
We consider the hyperbolic isometry y, : z+ Az for some A > 1. This leaves 
the imaginary axis in H invariant, and so its image on the quotient H/T by 


the group I’ generated by y, is a closed geodesic of length /, 3 a = log. Via 


zt> loga exp (zt (log(—iz) + log2)), H/T is mapped onto C*, and the closed 

geodesic is mapped onto the circle |w| = loga. 

In order to see how the geometric model can be translated into the analytic 
one, one uses the collar lemma, which says that if & = H/T is a compact Rie- 
mann surface with a simple!* closed geodesic c of length /, then © contains an 
annular region, called a collar, about c isometric to A; with the hyperbolic metric 
A;, c corresponding to the middle curve |z| = ./|¢]. The boundary curves of the 
collar then are at a distance from c of at least aresinh (sRt775) which goes to co 
as | —> 0. Thus, we are in the local situation described by the analytic model. 

In fact, a theorem of Mumford says that pinching a simple closed geodesic is 
the only way a sequence of compact Riemann surfaces &, = H/T, of fi ed 
genus p can degenerate. Namely, if the lengths of (simple) closed geodesics 
on &,, are uniformly bounded below, then after selection of a subsequence, I’, 
converges to a subgroup Ip of PSL(2, R) for which Xo = H/T is a compact 
Riemann surface of the same genus p. 

5. Since we have equipped M, in approach 7 with a Riemannian metric, the 
Petersson—Weil metric, we can study its compactificatio as a metric space. 
Again, as follows from the computations and estimates of Masur [80], this leads 
to the same M, viewed as a topological space, see [107]. In particular, M, 
is not a complete metric space, that is, the boundary M,\M p is at finit dis- 
tance from the interior. Moreover, when we approach that boundary orthogonally 


'4That is, non-self-intersecting. 
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along some curve c, the tangent directions orthogonal to c converge to bound- 
ary tangent directions. For a survey of some recent refinement of these results, 
see [108]. For the relation with the completion of the space R, of Riemannian 
metrics, see [19]. 

As explained in 6 of Sect. 1.4.1, by Torelli’s theorem, the correspondence be- 
tween a Riemann surface and its Jacobian leads to an injective mapping from 
My, into the moduli space A, of principally polarized Abelian varieties of di- 
mension p. A,» is a quotient H,/A, of the Siegel upper half space by a discrete 
group (#7; is simply the Poincaré upper half plane, and H,/A > is then a higher- 
dimensional generalization of the modular curve H/SL(2, Z). Hp is the space of 
symmetric complex (p x p) matrices with positive definit imaginary part. The 
discrete group Ap is Sp(2p, Z), the group of real (2p x 2p) matrices M with 
integer entries that satisfy MJ M' = J for J = Cz d oO) It admits a compactifi 
cation firs studied by Satake. Baily [8] then studied the induced compactificatio 
Mp. This is different from M, and, in fact, highly singular. It can be obtained 
from M p by forgetting the positions of the punctures or cusps of the limiting Rie- 
mann surfaces in M p- This is useful for the study of minimal surfaces of varying 
topological type, see [60, 61, 68], because the punctures would correspond to 
removable singularities. We shall discuss this issue briefl below in our study of 
the Dirichlet integral, our fundamental action functional, see Sect. 2.4. 

Also, in string theory, one ultimately wishes to extend the partition function 
over all possible genera, and one therefore needs some kind of universal mod- 
uli space that includes surfaces of all possible genera. The problem with the 
Mumford—Deligne compactificatio is that as the genus increases one gets sur- 
faces with more and more punctures in the low boundary strata, in fact infinitel 
many in the limit of the genus going to infinit . This is avoided in the Satake— 
Baily compactificatio just described. 

There is another issue of interest here: We have described the degeneration of 
a family of Riemann surfaces by pinching a closed geodesic, that is, letting its 
length shrink to 0. These geodesics can be topologically of two different kinds. 
The firs possibility is that it corresponds to a nontrivial homology class. When 
we pinch such a geodesic to a point and compactify the resulting surface by in- 
serting two points, in the limit we still have a connected surface, but of lower 
genus, and therefore its space of holomorphic 1-forms has a smaller dimension. 
Therefore, the limiting surface also has a Jacobian of smaller dimension, and so 
we move into the boundary of A. The other possibility is that we pinch a curve 
that is homologically trivial, i.e., a commutator in the fundamental group 7 (X). 
If we pinch such a curve and again compactify by inserting two points, the re- 
sulting surface is disconnected, but the genus p is not lowered, that is, the sum 
Pi + p2 of the genera of the pieces Xj and Xz equals p. Therefore, also the di- 
mension of the Jacobian is not lowered, and although we move to the boundary 
of the moduli space M,, we stay inside the moduli space A». The Jacobian of 
our disconnected surface is simply the product of the Jacobians of the pieces D1 
and D2. Of course, in order to substantiate these contemplations, we need to clar- 
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ify in which sense the Jacobians of the family of degenerating surfaces converge 
to the Jacobian of the compactifie limiting surface. 


1.5 Supermanifolds 


1.5.1 The Functorial Approach 


We present here the abstract mathematical setting of supermanifolds. We consider 
a super vector space (over a ground fiel of characteristic 0, like R or C) 


W=We Wi 


that is Z/2Z graded. Elements w of Wo are called even, with parity p(w) = 0, 
those of W odd, with parity p(w) = 1. Morphisms between super vector spaces are 
required to preserve the grading. 

A super algebra A is a super vector space together with a product A@ A— A 
which is a morphism in the above sense. It is also required to be associative and to 
have a unit, in the ordinary sense. 

Now, the important point about super objects is that whenever an operation 
changes the order of two odd elements, a minus sign is introduced. In this sense, 
the super algebra A is (super)commutative if for any two a,b e€ A, 


ab = (—1)?@P® ba. (1.5.1) 


(Here and in the sequel, whenever the parity of an element enters a formula, that 
element is implicitly assumed to be of pure type, that is, either odd or even, but not 
a nontrivial sum of an odd and an even term. Generally, definition are extended to 
inhomogeneous elements by linearity.) 

The basic example of a commutative super algebra is a Grassmann algebra with 
generators v),..., uy satisfying 


vjvj=—vjy;_ for alli, j (1.5.2) 
and thus, in particular, 
ve =0 for alli. (1.5.3) 
Hence, every element of this Grassmann algebra can be expanded as 
N 
v=aot ayy; +++ +a12,.vv102-+- UN. (1.5.4) 


i=] 
Since the square of any generator vanishes by (1.5.4), the expansion terminates. 


Similarly to (1.5.1), the rules definin Lie algebras pick up signs in the super 
context: The bracket of a super Lie algebra has to satisfy 


[v, w] + (—1)?? Tw, v] =0 (55) 
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and the super Jacobi identity reads 


[v,[w,u]]+ (—1)POPM+ PO) Ty, [u, v]J+ (—1)PMOCPM+PO) Ty, [v, w]] =0. 
(1.5.6) 
We now consider a complex super vector space W. A real structure on W is given 
by a C-antilinear automorphism 


k:W->W 
with 
Kew = (-1)P? w. O37) 


This should be considered as complex conjugation. We point out, however, that on 
the odd part, we obtain a minus sign in (1.5.7). As an example, let us assume that 
over R, the odd part W has two generators 1 and 2; we may then put 


K(01) =v, K(02) = —V1. (1.5.8) 


A supersymmetric bilinear form! (-,-) on W is given by a symmetric form (-, -)o 
on Wo and an alternating form (-,-); on W; with 


(kv,Kw); =(v,w); fori =Oand 1. (1.5.9) 
This implies 
(v, Kv) = (kv, Kv); = (— 1) (kv, v); = (v, KV);, (1.5.10) 
that is 
(v,«v) isreal forall ve W. (1.5.11) 


We may thus call the form (-, -) positive if 
(v,kv)>0  forallu40. (1.5.12) 


We can then defin (v,«v) as a “norm” on a complex super vector space. The point 
is that (v, v) = 0 if v is odd. We therefore need « which is only meaningful if W 
is define over C so that each odd coordinate has two real components, as in our 
example. In that example, we could put 


(01, 02) =1. (1.5.13) 


Then 1, and 2 would both have “norm” 1. 

If we have a complex super algebra A, we could then require that «(ab) = 
x(a) x(b). If we wish to also include non-commutative algebras, like matrix al- 
gebras with their complex conjugation, it seems preferable to take as the basis ob- 
ject a star-operation, a C-antilinear isomorphism from A to the opposite algebra!° 
satisfying 


(ab)* = (—1)POPO p*g*, (1.5.14) 


'SForms always take their values in C. 


'6Tf the product in A of a and b is ab, the product in the opposite algebra is define as 
(—1)P OP ba, 
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A Hermitian form (-,-) on a complex super vector space is C-antilinear in the firs 
variable, C-linear in the second one and satisfie 


(vu, wy = (—1)?OPM ) (w, v). (1.5.15) 
We have, since (-, -) is assumed to be even, that 
(v,w)=0 if pw) 4p), (1.5.16) 
and also 
(v,v) €R_ for v even, (v,v) €iR_ for v odd. (1.5.17) 


Then a super Hilbert space H is a super vector space with a Hermitian form satisfy- 
ing 


(v,v) >0 for v even, (1.5.18) 
i-'(v,v) >0 for v odd (1.5.19) 
and for which the ordinary Hilbert space structure define by 
(vu, w)) = (v, w) for v, w even, 
((v, w)) =i7!(v, w) for v, w odd, 
((v, w)) =0 for v, w of different parities 


is complete. In the present treatise, we shall be concerned only with finite 
dimensional super Hilbert spaces,'? and the completeness is not an issue then be- 
cause finite-dimensiona Euclidean spaces are always complete. 


1.5.2. Supermanifolds 


As for ordinary manifolds, there are several approaches to the definitio of su- 
permanifolds, and it is instructive to understand the relations between them. The 


standard model is R”!" with even coordinates (x!,...,x’") and odd coordinates 
(o!,..., 9”). Its sheaf of functions is C[3!,...,9”], the sheaf of commutative 
super algebras freely generated by odd quantities 3!,..., 9” over the sheaf C® of 


smooth functions on R”. Since the square of any / vanishes, they generate a nilpo- 
tent ideal in this sheaf. 

The functions in C~[#!,...,%”] then admit expansions in the nilpotent vari- 
ables. To explain this, we firs consider x = (x!,...,x) € U (open in R”) 
and & = (é!,...,&’") where the é! are even nilpotent elements, i.e., of the form 
ye 500g ay 019 o19°+4 higher even-order terms, that is, the expansion starts with 
products of two 2's. For a function that depends only on the even variables, we then 
require 


1h ee ee ae LL, 


1 
tl HEP DEN EM (1.5.20) 
y m-* 


'7Perhaps, one should better speak of super Euclidean spaces in that case. 
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where F as a function of x = (x!,...,x”) is of class C®(U). Alternatively, we 
can view this as the rule for extending or pulling back a function of the ordinary 
coordinates x = (x!,...,x’") to one of the coordinates x + € = (x! +é!,...,x"+4+ 
&"), When the function is also allowed to depend on the odd variables, we have the 
expansion 


PO EE cat ee" ce ) 
1 
= 0 a «a recat en emyind 9 
Vie Ym: a 
a Y 
(1.5.21) 


where the functions F% are of class C®(U). In these expansions, we may also allow 
for functions F® taking their values in a supercommutative algebra with unit in 
place of R. Usually, these functions will then be even. We note that the expansions 
(1.5.20) and (1.5.21) contain a number of derivatives that depend on n. Since we 
want to reserve the fl xibility to keep n variable, we must work with C°- instead 
of C*-functions for some finit k. 

There also exists a notion of (formal) integration, the Berezin integral, that inverts 
differentiation. 

If F is only a function of one odd variable 3, we have 


F(d}) =a+bo (1.5.22) 


where a, b are constants, i.e., independent of %. The integral of F w.r.t. # is then 
define by linearity and the basic rules 


[ao=o. foao=t. (1.5.23) 


This makes the integral translation invariant, 1.e. for an odd ¢, 
; FO +e)dv= Jo + bd + be)dd = bf vd} = i F(o)dv. (1.5.24) 


Similarly, for a function F of n odd variables oloo.., ov", 


1 ny iw with a = 0 or@ = (a),..., ax) 
ce ea a ( ee (1.5.25) 


the integral is computed via the rules 
[ao =0, [ s'av = bij. (1.5.26) 
Thus, we have 
[ra0%d0% --dd% =by fora =(ay,..., a). (1.5.27) 


A supermanifold of dimension m|n can be define by an atlas whose lo- 
cal charts are open domains of R”", that is, subsets with sheaf of functions 
C™(Up)f[v!,...,"], where Up is an open subset of R”. In terms of functions, 
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we are restricting the sheaf C®[#!, ..., %”] to Up. This will of course be the gen- 
eral procedure for definin sub-supermanifolds. Note that we are restricting the even 
coordinates x!,...,x’” here, but not the odd ones. So, we have coordinate charts; 
coordinate transformations are then given by isomorphisms f : U > V, U, V open 
in R’”!”", Such an isomorphism is given by even functions f!,..., f’” and odd func- 
tions ¢!, ...,@". To be an isomorphism, f must be invertible, and the functions 
must be smooth, as always. And a morphism is invertible iff the underlying mor- 
phism define by the f!,..., f” is invertible; the odd functions do not play a role 
for invertibility. 

Based on this, if F is a function on the chart U, and if f!,... f”,¢!,...," 
are coordinate functions on our supermanifold, we can compute the values 
F(f!,..., f',o!,...,¢"). We can therefore equivalently defin a supermanifold 
as a topological space Mo with a sheaf Oy of super (R)-algebras that is locally 
isomorphic to R”!", Functions on M are then sections of the structure sheaf Oy. 
Morphisms between supermanifolds f : M — WN are then morphisms of ringed 
spaces, that is continuous maps fo : Mo > No with a morphism of sheaves of su- 
per algebras from fj Oy to Oy. The odd functions generate a nilpotent ideal J of 
Om, because the square of any odd coordinate is 0. The space Mo with the sheaf 
Om/J is then a smooth manifold of dimension m, called the reduced manifold M,. 
A function f on M projects to a function f- on M,, that is, a smooth function on 
Mo. The sheaf morphism determines the function. In particular, the evaluation of an 
odd function at a point of Mo always yields 0. This also means that any map from 
an ordinary manifold, that is, a supermanifold of dimension m|0, into one of dimen- 
sion O|n vanishes identically. This can be remedied through the functor of points 
approach to supermanifolds. For a supermanifold S, an S-point of a supermanifold 
M isa morphism S$ — M. This construction is functorial in the sense that a mor- 
phism w : T > S induces a map from M(S), the set of S-points of M, to M(T) via 
mt mow. Similarly, a morphism f : M — N induces fs: M(S) > N(S), again 
functorially in S. In order to understand this more abstractly, we consider the so- 
called superpoints, the supermanifolds R®!” define as the space with structure sheaf 
R[o!,..., ”] (with anticommuting #7, as always). Expressed differently, these are 
the supermanifolds ({*}, A,) where {*} is an ordinary point and A, is a Grassmann 
algebra of n generators. Then, see [93], these superpoints generate the category of 
finite-dimensiona supermanifolds, that is, any such supermanifold is completely 
described by its superpoints. For supermanifolds M, N, one then define (or, more 
precisely, shows the existence of) the inner Hom object Hom(M, NV) satisfying 


Hom(R°!”, Hom(M, NY) = Hom(R®!” x M, N) (1.5.28) 
for all n € N and then also 
Hom(S, Hom(M, N)) = Hom(S x M, N) (1.5.29) 


for all supermanifolds S. In this way, the space of morphisms M — WN also becomes 
a functor: For a supermanifold S$, a morphism M x S$ > N, that is, a morphism 
M — N depending on a parameter in S, is then an S-point of Hom(M, N). The 
morphisms R°!” x M -> N are then the superpoints of the supermanifold of mor- 
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phisms Hom(M, NV) (in contrast to Hom(M, N) which is not a supermanifold, but 
rather the reduced space (see below) underlying the supermanifold Hom(M, N)). 
In that way, we see that there also exist nontrivial odd functions on an ordinary 
manifold, say R, even though their values vanish on all points of R. To be concrete, 
consider § = R®”. R x S then has the sheaf C(R) @ R[v!,..., 0]. Now take 
another space T that is odd like S, with sheaf R[7!,...,7’"]. We consider a map 
w:Rx ST, that is y:R!!" — Rl”, given by 


C°(T) =R{n!,...,7”] > C?(R) @RIv!"..., 9], 


n! — aj (rok 
which we can also write as 
W(t) =al (yor. 
Of course, this vanishes at all points of R = (R x S)o, but nevertheless it is a non- 
trivial morphism. 


In the converse direction, let us take n = 1, i.e., consider S = R°!!, the space with 
sheaf R[] (7 = 0), and a morphism 


S— Mo 
into some ordinary manifold Mo. This is given by an algebra homomorphism 
C™(Mo) > RIV], 
freag(f) +a(fv. 
The homomorphism condition implies firs that 
ay: C°(Mo) > R, 
fre af) 


is an algebra homomorphism, and, as is easily derived, it is therefore given by the 
evaluation at some point x € Mo, that is ag(f) = f(x). Secondly we obtain, using 
the homomorphism condition, 


ao(fg) + ai(fg)0 = (ao(f) + a1 (f)P) (ao(g) + a1(g)?) 
= f(x)g(x) + (f (wai (g) + g@)ar(f))8, 


which means that a, is a derivation over functions, that is, the derivative in the 
direction of some tangent vector v, € T, Mo, 


a(f) =oxf. 


Thus, we could view the super point S with its sheaf R[#] as an abstract (odd) 
tangent vector. The maps S — Mp correspond to points in the tangent bundle T Mo. 
If M is a general supermanifold, the same applies, except that we get a sign from 
the odd functions, that is, 


ai (fg) =ai(f)g(x) + (-L)?™ Ff (xJar(g). 
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Thus, a, is an odd homomorphism from the local ring at x to R. 

In any case, we have a projection M — M, from a supermanifold to its reduced 
manifold M,. Conversely, by Batchelor’s theorem, any smooth supermanifold is 
(non-canonically) isomorphic to one of the form (M,, A*V). Thus, we can obtain 
M from the smooth ordinary manifold M, and a locally free module V over the 
sheaf C°(M,); namely, M can be obtained as M, with the sheaf A*V graded by 
the exterior degree mod 2, and the inclusion of C°(M,) into A*V define a mor- 
phism M — M, that retracts the embedding of M, into M. It is important to realize 
that these constructions are not canonical, since they are not invariant under auto- 
morphisms of M if m > 1,n > 2. Namely, simply consider R!!? with coordinates 
(x, 0!, 9) and the automorphism 


(x, 31, 8) @+o!d?, 2, 9). 


This example also shows us that decompositions of functions according to their 
degree are not invariant under automorphisms, and thus not invariant under coor- 
dinate transformations. Namely, if we have a function f that, in the coordinates 
(x, 0! 8), only depends on x, and if we denote its expression in the new coordi- 
nates (x + 0!92, o!, 97) by g, we have 


f@=eet+o!9?, 2!, 9) 
= go(x) + gg(x)0' 8? + gi(x)o!0?. 


Here, we have used the rule (1.5.20) for the Taylor expansion for a function gg of 
the even coordinates, and we then need to add a counter-term gj(x) = —g5(x) in 
order to compensate for the #!9* term from the Taylor expansion of go. Note that 
here we work over a trivial base S. 

If we are Taylor-expanding functions as explained, then if Mo is an ordinary 
manifold, that is, a supermanifold with odd dimension 0, and if we consider a map 
fs: Mo x S — N, then the odd dimension of S determines the maximal degree oc- 
curring in that expansion of fs. In the physics literature, one expresses this by fixin 
the number N of Grassmann generators. In the present framework, this corresponds 
to the odd dimension of S. 

One should also note that a super vector space W = Wo @ W| is not a superman- 
ifold, unless the odd part W, is trivial. If the even and odd part have dimensions m 
and n, resp., then W has the underlying structure of an m + n-dimensional ordinary 
vector space, whereas the ordinary manifold M, underlying an (m|n)-dimensional 
supermanifold is only m-dimensional. Of course, one can canonically construct a su- 
permanifold from a super vector space, but as such, the two structures of a super 
vector space and of a supermanifold are different. 

A super Lie group is a supermanifold that is functorially characterized by the 
property that for all supermanifolds 5S, Hom(S, M) is a group such that the group 
operations are smooth morphisms of supermanifolds. It can be obtained by exponen- 
tiation from a super Lie algebra. More precisely, however, for that exponentiation, 
we also need to be able to multiply the elements of the super Lie algebra by the odd 
variables #/, that is, on the super Lie algebra, we also need the structure of a left 
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supermodule over the algebra spanned by the 3/.'* For a super Lie group H, we 
can consider left multiplication by an element h 


Ly: H—> dH, Lyr(k) =hk forke H. (1.5.30) 
This induces a map (Lj), on the vector field on H, given by 
((Ln)aX)F := X(FoL,) for functions F. (1.5.31) 


When (Ly;),X = X for all h € H, the vector fiel is called left-invariant. The left- 
invariant vector field then span a super Lie algebra (with the graded commutator of 
vector field as the bracket) that is also a super module over the odd variables. 

To see the principle, we consider R'!! with coordinates t, . This space carries 
a super Lie group structure given by 


3650 25=C +74 0107, 0' +0), (1.5.32) 
The translation in the t-direction is generated by the vector fiel 
a 
nf := — J, 1.5.33 
(= 3) (1.5.33) 
the one in the 7-direction by 
D:= 05 — ¥0;. (1.5.34) 


We note that D does not induce a morphism in our sense as it changes the parity. 
We have the relation 


[D, D] =2D* = —24. (1.5.35) 


(D, 0;) constitute a basis of the left invariant vector field on the super Lie group, 
while (Q := dy + 10;, 0;) is a basis for the right invariant ones. We shall meet these 
vector field when we consider supersymmetry transformations. The important point 
is that they generate diffeomorphisms of the superspace R!!!, 


Remark The treatment of supermanifolds presented here has been developed by 
Leites [76], Manin [79], Bernstein, Deligne and Morgan [24], and Freed [39, 40]. 
Another reference is [102]. The comprehensive presentation of the subject is [11]. 
The superdiffeomorphism group is investigated in [94]. 


1.5.3 Super Riemann Surfaces 


As an example, we now consider super Riemann surfaces (SRSs). While above, 
we have define supermanifolds over R, it is straightforward to develop the same 
constructions over C. An SRS then has one commuting complex coordinate z and 


'8We only have a supermodule instead of a super vector space because that algebra is only a ring, 
but not a feld. 
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one anticommuting one 2. In addition, the coordinate transformations are required 
to be superconformal. To explain this, we start with the coordinate transformation 
formula for a single supercomplex manifold M of complex dimension (1|1), which 
has to be even, that is, of the form 


£ (1.5.36) 
v= vh(z) 


with holomorphic functions f and h where f is required to have a nonvanish- 
ing derivative, that is, to be conformal. The structure sheaf is thus of the form 
Ou =0Om.o®Owm.1 where Oy 0 is the sheaf of holomorphic functions on the under- 
lying Riemann surface M, and Oy, 1s a sheaf of locally free modules of rank 0|1 
over Oyo. Up to a change of parity, this then define a line bundle L over M,, and 
conversely, given such a line bundle L over M,, changing the parity of its sections 
from even to odd then define the structure sheaf of supercomplex manifold of di- 
mension (1|1). Thus, such (1|1)-dimensional supercomplex manifolds and ordinary 
Riemann surfaces with a line bundle L stand in bijective correspondence. When we 
look at families of such supercomplex manifolds, however, we may also take base 
spaces with odd directions, and we have the more general transformation formula 


z= f@)+0k@), 
B= g(z) + OA(z) 


with holomorphic functions f,k, g,h and f again conformal. 

In order to defin a super Riemann surface, we require in addition that the struc- 
ture be superconformal. This means the following: We look at the derivative opera- 
tors 0, and tT := dy + ¥d;; they satisfy 


1 


(1.5.37) 


s[t. t= r=d. (1.5.38) 

We have the transformation rule 
T= (tO)E + (tZ — D)Z. (1.5.39) 

(To see this, one computes 

0, = (f, + 0kz)0z + (g, + Bhz)d5, (1.5.40) 
d9 = hos + kdz, (1.5.41) 
To =h4+ 0g, (1.5.42) 
tZl=k+0f, (1.5.43) 


from which 
T = dy + 00, = (h+ 082)95 + (K+ Of;)9: 
= (ht 0g,)(85 + 09s) — (g + PA)(A+ Vg,)d¢ + (k + Of): 
= (td)E + (tZ—B1dD)7? (1.5.44) 
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which is the required formula.) In the same manner as for an ordinary Riemann 
surface, that is, one with transition functions z = f(z), the holomorphicity of f 
implies that 0, is a multiple of 0;, 0, = 0, fd;. We now require for an SRS that 
t transforms homogeneously, that is, t is a multiple of tT. In view of (1.5.39), for 
a family, this leads to the transformation law 


Z= f(z) + 0g(z)h(z), 


. (1.5.45) 
0 = g(z) + BA(z) 


with 
h? (z) = 9; f (2) + g(z)8z8(2). (1.5.46) 


Here, f(z) isa commuting holomorphic function with i f #0, 1e., f is conformal, 
and g(z) is an anticommuting one. 

These transformations then leave the line element dz + 0 dv invariant up to con- 
formal scaling. (The conformal factor is af (z) + ge g(z), and one has to use 
(1.5.46).) 

Given a single SRS &, we can put all the g = 0 and obtain the transformation 
rules 


z= f(z), 


- (1.5.47) 
0 = th(z) 


with h?(z) = 0, f(z). The holomorphic transformation functions f of z defin an 
ordinary Riemann surface &,, but the transformations of the odd coordinate 3 ad- 
ditionally require the choice of a square root h(z) of z f(z). In other words, they 
determine a spin structure on ,. If p is the genus of ©, we have 27? different spin 
structures on &,. In particular, we see that the super Teichmiiller space of super 
Riemann surfaces of genus p has at least 27? components (this does not hold for the 
super moduli space, because modular transformations can mix the spin structures). 
By the Riemann—Roch theorem (stated in 3 of Sect. 1.4.2 and recalled below), the 
number of even moduli (over C) minus the number of conformal transformations of 
x; is 3p —3 while the number of odd moduli minus the number of odd superconfor- 
mal transformations is 2p — 2. (The even moduli here can be identifie with sections 
of K*, where K is the canonical bundle of the underlying Riemann surface, while 
the odd ones correspond to sections of K>/?. The Riemann-Roch theorem says that 
the space of sections of a line bundle L over a Riemann surface & of genus p has 
dimension 


h(x, L) =degL — p+1+h°(x, K @L"!) (1.5.48) 


and the degree of the canonical bundle is 2p — 2.) 

On a sphere, we have no nontrivial spin structures and no super moduli, but, in 
agreement with the Riemann—Roch theorem, the superconformal transformations 
are of the form f(z) = — 7 (with the normalization ad — bc = 1), g(z) = vert 
that is, 3 even and 2 odd parameters. 
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More generally, on the supersphere, when, instead of (1.5.47), we allow for the 
general type of coordinate transformations (1.5.37), we obtain the orthosymplectic 
group OSp(1/2): 


a ba 
T=|c d By], 
y 6 ¢t 


a,b,c,d,t even (commuting), a, B, y, 6 odd (anticommuting). 
T'KT =K_ (T™“ supertransposed), 


with the orthosymplectic form 


0 1 0 
K={-1 0 0 
0 oO 1 
The transformation 
az+b+av yzt+é64+tv 
Re ———_—__, KB —______ 
cz+d+ pv cz+d+ pv 


leaves the line element dz + % dv invariant up to conformal scaling. 
In that case, just to see some formulae, 


dz 
dz zs 
(cz +d+ BY) 
Z12 
Z12 =Z Zz ape ed , 
ee eee Cex +d + BO N(CZa +d + BY2) 
0 
nips. 
cz+d+ po 


Obviously, this extends the operation of S/(2, C) to the super case. 
The supersphere can be covered by two coordinate patches, with transition 
| ~ iv 
L£= =3 a = —. 
Zz Zz 
(Cf. (1.5.45): Here, h = ,/2 f.) 
Genus | is next. A torus with a spin structure is described by the rigid super 
conformal transformation 


(z,0)=(c+1,mv)= (e+, 20), 


where T is taken from the usual period domain, and the n; = +1 determine the spin 
structure. Since the only holomorphic functions on a torus are the constants, we 
obtain a nontrivial supermodulus v only in the case of a trivial spin structure, that 
is, 7} = 1 =7np. In that case, the periodicities are 


ZWHSZ+LASc+t+v0v,0 4+). 
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In agreement with Riemann—Roch, we then also have the odd superconformal trans- 
formation given infinitesimall as 


(z, 0) (c+0eE,0 +68). 


In particular, we see that there can exist nontrivial odd moduli, and the supermoduli 
space is bigger than just the moduli space of ordinary Riemann surfaces with spin 
structures. We observe, however, that ¢  —eé is a superconformal transformation, 
and so the supertori corresponding to e and —e are equivalent. Thus, the correspond- 
ing component of the supermoduli space is a Zz super orbifold, with a singularity at 
e=0. 

When we look at functions on this super torus, we obtain the periodicity condi- 
tion 


fZAN=fEtt+ ov, +), 
that is, after Taylor expanding, 
fo(z) + iP = foe +t) + foz+t)0v + fiZ+7)0 +v) 


which implies that f} vanishes when v ¥ 0, that is, fo is constant (over a trivial base 
S again). f1 is less trivial. The situation becomes richer when we look at mappings 
between two such supertori, with moduli (t, v) and (7, ¥), resp. We then expand to 
obtain 


fo) + E+ Ai(Ded = fo(ztr)+ foietr)ov 
and 
Ai@v+veficctrOo+yv). 


The firs equation expresses fj in terms of f| or conversely, while the second one 
restricts f|. However, we should be careful here as fp need not be holomorphic, and 
so fj stands for a (2 x 2)-matrix. 


Remark Fora treatment of super Riemann surfaces as needed for superstring theory, 
we refer to Crane and Rabin [21, 89] and Polchinski [88]. A general mathematical 
perspective is developed by Leites and his coauthors in [32]. A very lucid discussion, 
which we have also partly utilized here, can be found in [93]. 


We should note that the above definitio is not the only possible for an SRS. In 
fact, there are several superextensions of the conformal algebra, and each of them 
could be taken as the basis for the definitio of an SRS. The one used here corre- 
sponds to the superconformal algebra €/(1|1) and yields the N = 1 worldsheets of 
superstring theory and 2D supergravity. 


1.5.4 Super Minkowski Space 


Now assume that we have a vector space V with a quadratic form Q and a represen- 
tation of the Clifford algebra C/(Q) for which a symmetric equivariant morphism I" 
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as in (1.3.25) exists. Following the presentation in [22], we may then construct an 
object that captures deeper aspects of the physical concept of supersymmetry than 
just a supermanifold, namely a space that incorporates the symmetry between vec- 
tors and spinors as representations of bosons and fermions, resp. For that purpose, 
we consider the vector space V as the Lie algebra of its translations, and construct 
the super Lie algebra 


(:=Ves* (1.5.49) 
and the bracket [., .]. This bracket is trivial on V (that is, V is central) and is given 
by 

[s,t] = —20(s,t)eV (1.5.50) 
on S*. Super Minkowski space M is then define as the supermanifold underlying 
the Lie group exp(I); its reduced space is thus given by the affin space V, and its 
odd directions are given by S”*. 


In the Minkowski case, the super Lie algebra (1.5.49) leads to the super Poincaré 
algebra 


(V @s0(V)) ® S*. (1.5.51) 
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Chapter 2 
Physics 


2.1 Classical and Quantum Physics 


2.1.1 Introduction 


In this section, we will describe some important principles at a heuristic level. We 
hope this will be useful as a guide to some of the sequel which is more formal, but 
whenever the meaning of this section appears unclear, the reader should proceed 
to the more formal treatment below. There are many textbooks available on the 
mathematical aspects of quantum mechanics, for instance [53]. 

Classically, a particle is represented as or described by a point in some state 
space M. It moves in time along some trajectory x(t) that is a solution of a system 
of second-order ODEs (a dot denoting a derivative with respect to time f), 


K(t) = f (x, %) (2.1.1) 


that is derived from an action principle. This principle consists in minimizing the 
Lagrangian action 


S(x) = [ Fow,i@par, (2.1.2) 


the integral w.r.t. time over some Lagrangian that is a function of x and its firs tem- 
poral derivative.! As will be discussed in more detail in Sect. 2.3.1 below, a mini- 
mizing x(t) satisfie the corresponding Euler-Lagrange equations 


ddF dF _ 0 (2.1.3) 

dtdxk dx ~ = 
Here, the space M is d-dimensional, and in local coordinates x = (x!,.. #4). 
Alternatively, one may utilize the 2d-dimensional phase space N with coordinates 
(xt, 2.2, x4, x9! =i! 12. 2d = x4), 


oF oF 


alee? cron When one intro- 


ue stands for the covector of partial derivatives ( 
duces this covector as a new variable, that is, puts 


pi=—, ie, pps (2.1.4) 


'We consider here only the autonomous case; in the non-autonomous case, the density may also 
explicitly depend on t, F(t, x(t), x(t)), and not only implicitly through its dependence on x(t) 
and x(t). 
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one arrives at the Hamiltonian formulation. This involves the Hamiltonian 


_adF 
A(p,x):=x—-F (2.1.5) 
dx 
where xo =) xd a4 =) x/ p;. A solution is then obtained from the Hamilton 
equations , 
; dH . aH (2.1.6) 
=—-—_, x=—. wl. 
7 dx dp 


The Hamiltonian formalism singles out time and is therefore not relativistically in- 
variant. Consequently, in our treatment of QFT, we shall mainly employ the La- 
grangian formalism. 

The standard example is 


Fa? -—Vvo) (2.1.7) 
=" ae 


where m is the mass of the particle and V the potential. The Euler-Lagrange equa- 
tions (2.1.3) are then 


dV 
¥ = —— 2.1.8 
mx Fe ( ) 
(in components: mx! = — ar). The Hamiltonian is then 
2 
=a ae 
H= |x + V(x) = + VQ), (2.1.9) 
2 2m 
and (2.1.6) becomes 
dV 
iS, SS (2.1.10) 
dx m 


For a solution (x(t), p(t)) of (2.1.6), we can then also compute the time evolution 
of any function A(x, p) via 
dA <dA,, dA... JOADH dJIAOH 


=—*'4+— p=— -=: {A, H}, 2.1.11 
dt axi api? Ox! Opi = Op; Ox! ( ) 


where the last expression is called the Poisson bracket. It satisfie all the proper- 
ties of a Lie bracket, as well as the canonical relations (Heisenberg commutation 
relations) 


{x',x/} =0={pi, pj} and {x', pj} =6'. (2.1.12) 


Equation (2.1.11) is obviously a generalization of (2.1.6) (in the sense that <! = 
{x!, H}, p ; = {pj;, H}), and it also tells us that conserved quantities, that is, time- 
independent quantities, are precisely those whose Poisson bracket with the Hamil- 
tonian H vanishes. 
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Quantum mechanics was discovered by Heisenberg and developed with Born and 
Jordan as the description of quantum theory through the correspondence with clas- 
sical mechanics via matrix algebra. We now describe this, employing more modern 
terminology, of course. Quantum mechanically, in place of a point x in M, we have 
a probability distribution |p (x)|? derived from a function @ : M — C with 


lolli (- [ JP s)avorts)) =i: (2.1.13) 


| (x)| can thus be interpreted as the probability density for findin the particle un- 
der consideration at the point x. Here, for the L?-norm, we need a volume form dvol 
on M; that volume form could come from a Riemannian metric. The classical case 
is recovered as the limit where this probability distribution becomes concentrated at 
a single point, that is, a delta function(al). In quantum mechanics, the observables 
are self-adjoint operators on the Hilbert space 1( := L*(M, C). As self-adjoint op- 
erators, they have a purely real spectrum. The eigenvalues corresponding to eigen- 
states of such an operator then represent sharp observations. These operators, how- 
ever, are typically unbounded which leads to certain mathematical difficulties as 
will be described in more detail in Sect. 2.1.3 below. 

In the formalism of canonical quantization, the momentum p; becomes the op- 
erator ne The total energy, the Hamilton function above, thus also becomes an 
operator, the Hamiltonian operator H, and the state @ evolves in time ¢ according to 
the Schrédinger equation 


r) ft 
ih oe 1 Se cea, (2.1.14) 
For the Lagrangian (2.1.7), the Schrédinger equation (2.1.14) becomes 
ad(x,t h 
noe =-5 Abe. N+ V)O.0. (2.1.15) 
m 


ot 
The ansatz (x, t) = (x) exp(— f Et) of separated variables leads to 


he 
7 AO) + V(x)O(x) = EQ), (2.1.16) 


the time-independent Schrédinger equation. 
We can arrive at (2.1.15) from the ansatz of representing @(x, t) as a wave: 


1 Bal 
00.0 = sap exp 7 Pv Et)=: (x, t|p, E), (2.1.17) 


where we have already introduced Dirac’s notation to be explained below. Then 


; 
~_$(x,) = — Pi P(E.1), (2.1.18) 


ox” 
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) i 


On the basis of this computation, we then put 


—=-);, 2.1.20 
ax” APs ( ) 
a i 
—=-_—E. 2.1.21 
ot h ( ) 


For the Hamiltonian (2.1.9), we are then naturally led to (2.1.15), and with the ansatz 
P(x, t) = (x) exp(—¢ Er) at (2.1.16). 


Remark We use here the so-called Schrédinger picture where the states ¢ are evolv- 
ing in time. In the complementary Heisenberg picture, instead the observables, rep- 
resented as self-adjoint operators A, evolve according to 

dA 


ih =[A,H 2422 
Ma ee); ( ) 


in analogy to (2.1.11), see (2.1.92), (2.1.93). 


In the quantum mechanical view, the fiel @: M — C is obtained from the quantiza- 
tion of a point particle. There is, however, another interpretation of @ that turns out 
to be more fruitful for our purposes. Namely, we can view ¢ also as a classical fiel 
on M. It then need no longer satisfy the normalization ||@|| 2 = 1. Also, it need no 
longer take its values in C only, but it can also assume values in the fiber of some 
vector or principal bundle or some manifold. As a classical situation, it can then be 
quantized again, and one then speaks of a second or fiel quantization. The analog 
of the Schrédinger equation is then a PDE on some function space, that is, a PDE 
with infinitel many variables. 

There is an important generalization of this picture: When the particle possesses 
some internal symmetry, described by some Lie group G, the space C gets re- 
placed by a (Hermitian) vector space that carries a (unitary) representation of G. 
Thus, a particle is described by some  € L?(M, V), again of norm 1, so that || y||? 
(where ||.|| is the Hermitian norm) can again be interpreted as a probability density. 
The vector space V enters here in order to distinguish different states that are not 
G-invariant, as G leaves the space V invariant, but not the individual elements of V. 
This is needed because not all physical forces will be G-invariant. An example is the 
electron with its spin. Since there are only two possible values of the spin, here the 
vector space is finite Z2, and the corresponding Hilbert space is finite-dimensional 
C*. Quantum electrodynamics (QED) then couples the Maxwell equation with the 
Dirac equation for the electron spin on a relativistic space time. The standard model 
of elementary particle physics interprets the observed multitude of particles through 
symmetry breaking from some encompassing Lie group G that contains all the sym- 
metry groups of the individual particles. Of course, we shall explain this in more 
detail below. A quick and useful introduction to the topics of this section can be 
found in [90]. 
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2.1.2 Gaussian Integrals and Formal Computations 


Before proceeding with quantum physics, we introduce a basic formal tool, 
Gaussian integrals, that serve as a heuristic transmission line from finite-dimensiona 
exponential integrals to infinite-dimensiona functional integrals. 

We start with the bosonic case. Let A be a symmetric n x n-matrix with eigen- 
values 


Ai > 0; (2.1.23) 
and let b be a vector. 
The Gaussian integral (x = (x!, ...,X")) is 
i 1m (my)? 
I(A) = [ en(-> Axa seedx" = (=) (2.1.24) 


with det A = [/_, 4;, as follows easily by diagonalizing A. A formal extension of 
this formula to infinit dimensions is often based on expressing the determinant of 
A in terms of a zeta function; we defin the zeta function of the operator A as 


n 


1 
ta(s) = Dogg! fors €C. (2.1.25) 


Since A," =e7* logs we obtain for the derivative of the zeta function 


: ua logrAx 
= ) . 2.1.2 
o4(S) ra ou ( 6) 


Therefore, we can express the determinant of A in terms of the derivative of the zeta 
function at 0: 


n 
det A= [aj =e. (2.1.27) 


i=1 


The general Gaussian integral 
1 
I(A, b) = [ ax! -dx"exp(— jx" vt Ox) (2.1.28) 


is reduced to this case by putting 


xi= A lb+y 
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(note that x9 := A~'b minimizes the quadratic form 5x Ax — b'x). Namely, we 
obtain 


1 1 
1(A,b) =exp( 5614-6) f exp(—Fy'4 yay’ . dy" 


1 
ta a \ (my? 
=exp(50'4 (Sr) 


1 ; 
= (2n)"/? exp( 5o'A'n eH, (2.1.29) 


using (2.1.27) for the last line. 
In many cases, the vector b has an auxiliary or dummy role. Namely, we wish to 
compute moments 


fxit-+-alm exp(—3x'Ax)dx!.--dx" 


xil ae . xm : 
fexp(—4x! Ax) dx!..-dxn 
1 a F 
= TA) abn Ryn 1A» Plo=0- (2.1.30) 


In particular, the second-order moment or propagator is 
(vix/) = (AM), (2.1.31) 


When mm is odd, the moment (2.1.30) vanishes because the (quadratic) exponential 
is even at b=0. 
For even m, we have Wick’s theorem 


(xl eee ly = >. (4~") 
all possible 
pairings of 
(i, ,..:4m) 


(47) 


(2.1.32) 


Tp) lp lpm lpm 


as follows directly from (2.1.29) and (2.1.30). 

As a preparation for the functional integrals to follow, we now wish to consider 
x! as an operator on the finite-dimensiona Hilbert space E” = R” with its Euclidean 
product. We then have the matrix elements 


(ej|x"! ---x"™Ie;) 


2 


f xt ---x'm 8 (x! — 18d — 1) exp(—4x! Ax) dx! ---dx" 
7 fexp(—4x! Ax) dxl...dx” 


Instead of the 5-functions, one can then also make arbitrary insertions into the func- 
tional integral, that is, functions of x. 
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In the Grassmann case, we start with a Grassmann algebra generated by 


Re a | rte Tig 


Jg( A) = if dn! dij! -- «dn! dij" exp(—ii' An) 
n n . : 
= fan ai! dh ai" TP] (= a Ayn’) 
i=1 j=l 


= 0° sign(p)Aipay-+ Anpiny 


permutations p 


= det A. (2.1.33) 
We next compute, for a Grassmann algebra generated by 7 1, oF", 
1 
J(A) = [ao “edo exp(-39'0). (2.1.34) 


We may assume that A is antisymmetric, as the symmetric terms cancel because the 
’s anticommute: 


way= f ao! ---do"T]( — 9'Ajj9/) 
Lay 
= a sign(P)A p(1) p2) Ap@)p4) *** Ap(n—1) pin) 
permutations p 
with p(2i—1)<p(2i), 


pQi-1)<p(2i+1) 
for i=1,...,n, or n—1, resp. 


=:Pf(A) (Pfaffian . (2.1.35) 


We have 
1 
P¢ay= f ao! ---d9™ ao" -.9™ exp(—3(0"Av +9409), (2.1.36) 


The coordinate transformation 


k 1 k gk 
= —(v v), 
n Fl +1 ) 
-k 1 k egik 
= —(v v), 
‘ vai ee, 


has the Jacobian (—1)" and satisfie v'0/ +99 = nini — nin’. 
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Using the antisymmetry of A, we obtain 


P(A) = f diy dil ---dy" di exp(—H! An) 
=det A (2.1.37) 
from (2.1.33). From (2.1.34), (2.1.35), (2.1.37) we see 
Pf?(A) = det A, (2.1.38) 
that is, 
J(A) = (det A)?. (2.1.39) 


As in the ordinary case, we also have 


1 
J(A,b) = | do!.--dd?" exp( —=0' Ad +b'0 
PL) 


= (Ayexp( 501 4~'b) (2.1.40) 


and likewise 


a 1 0d 40 
igly — =(A7!),, 


Another formal tool that is useful in this context are formal computations with 
Dirac functions. In the physics literature, linear functionals on space of functions are 
systematically expressed by their integral kernels. Thus, the evaluation of a function 
gy at a point y 


gr gly) (2.1.41) 


is written in terms of the Dirac 6-functional 
g(y) = [ depo — y) =d,(g). (2.1.42) 


If we change the variable z = f(w), this becomes 


ey) = / ae 


This formula is useful for calculating y(f(w)) at f (w) = y, without having to solve 
the latter equation explicitly. 
In analogy to (2.1.42), we also have 


0 
act “7 locrowy sc ftw 7 (2.1.43) 


F) a a 
pieor= | deg o@se—»=— f azo 8e-y) (2.1.44) 
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if we formally integrate by parts. Thus, we can defin the derivative qr —y) of 
the delta function 6(z — y) as the functional 


a 
gr rer ec (2.1.45) 
y: 
We now consider a functional ® 


pr> &&) 


define on some Banach or Fréchet space B. The Gateaux derivative 6® in the 
direction dg is define as 


5D 
56()(89) = 2 


1 
= ee ate) (2.1.46) 


provided this limit exists. Here, dg is, of course, also assumed to be in B. When the 
limit in (2.1.46) exists uniformly for all variations dg in some neighborhood of 0, 
we speak of a Fréchet derivative. Some formal examples: 


5p(f)" (59) = ndy(f)o( fy". (2.1.47) 
de?) (Sp) = dy(fye®™. (2.1.48) 


We are usually interested in Lagrangian functionals, 


L(u) = [ FG.ue), due de. (2.1.49) 


L is usually define on some Sobolev space of functions. We then have 


AO = & [ PGue) +s5u(8), dQu(G) +58uG))) dB 0. 
(2.1.50) 
The question arises as to which variations 6u one may take here. One class of vari- 
ations is given by the test functions, that is, the functions from the space D := Cj° 
of infinitel often differentiable functions with compact support. That space is not 
a Banach space, but only a limit of Fréchet spaces with topology generated by 
the seminorms | f|x,« ‘= sup,¢ lp" Ff (x)| for nonnegative integers k and compact 
sets K. Its dual space D’, that is, the space of continuous functionals on D, is the 
space of distributions. The best-known distribution is of course the Dirac distrib- 
ution already displayed in (2.1.42) above. Conceiving the Dirac functional as an 
element of D’, that is, as an operation on smooth functions (in fact, continuous 
functions are good enough here), is the Schwartz point of view. A different point of 
view, which does not need topologies with unpleasant properties (for example, the 
implicit function theorem is very cumbersome in Fréchet spaces) and is more useful 
in nonlinear analysis, is the one of Friedrichs, which considers the Dirac function as 
a limit of smooth integral kernels with compact support that in the limit shrinks to 


dL (u)(6u) = 
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a point. One then, in effect, never needs to carry out any formal manipulation with 
the Dirac function(al) itself, but only ones with such smooth integral kernels. The 
Dirac point of view, finall , simply performs formal operations with the Dirac func- 
tion. Thus, the different approaches consist in justifying, avoiding, or performing the 
Dirac function computations. The Dirac approach is prominent in the physics litera- 
ture, and we shall also follow that here, because we are assured that these operations 
can be made mathematically rigorous by either of the other two approaches. 

Having said that, we then also take functional derivatives in the direction of Dirac 
functions. That means considering 


oD 
5p(z) 


For (2.1.49), (2.1.50), we then have the formal relation 


1 
= fim (PO) +18 — Zz) — P(p(y)))- (2.1.51) 


6L 


dL (du) = i d&du(é) ; (2.1.52) 
du(é) 
with the consistency relation 
a= f dge- 9 (2.1.53) 
= z—&)——. Al. 
du(z) bu(é) 
Thus, sony measures the response of L to a change in u supported at z. 
We also have 
6 
— 9(x) = d(x — 2) 2.1.54) 
59(z) 
and 
é n n—1 
dxg" (x) =ng"*(z). (2.1.55) 
59(z) 


Looking at (2.1.42), (2.1.51), (2.1.54), we see that the operations with the Dirac 6- 
function are simply formal extensions of the ones with the Kronecker symbol in the 
finite-dimensiona case. 

In the Grassmann case, the Dirac 5-function is 


60)=0= if dnexp(n?) (2.1.56) 
satisfying 


[ass = 10 for f (8) = f (0) tad. (2.1.57) 


For more details on the formal calculus, see [114]. 
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2.1.3 Operators and Functional Integrals 


In this section, we want to amplify the discussion of Sect. 2.1.1 and introduce 
path integrals. We want to investigate the time evolution of a quantized particle. 
This is described by a complex-valued wave function ¢(x, t) whose squared norm 
|p (x, t)|? represents the probability density for findin the particle at time ¢ at the 
position x € M. Here, #(x, t) is assumed to be an L?-function of x so that the total 
probability can be normalized: 


i (x, )Pdvol(x) = 1 (2.1.58) 
M 


for all t. For a measurable subset B of M, the probability for findin the particle in 
B at time ¢ is then given by 


/ |p(x, t)Pdvol(x). (2.1.59) 
B 


More abstractly, a pure state |y) of a quantum mechanical system is a one- 
dimensional subspace, which we then represent by a unit vector 7, in some Hilbert 
space J. The scalar product is written as (¢| Wr); here, by duality, we may also con- 
sider (#| as an element of the dual space J{*. For a pure state w, we let Py, be the 
projection onto the one-dimensional subspace define by 7. As a projection, Py, is 
idempotent, that is, P? = Py,. Then 


lo, WP = (Pov, w) = tr Py Py (2.1.60) 


is the probability of findin the system in the state @ when knowing that it is in the 
state w. Let us assume that for some map T on the states of H{, we have 


\(T¢, TWP =b, WI? (2.1.61) 


for all ¢, w, that is, the probabilities are unchanged by applying T to all states. By 
a theorem of Wigner, 7 can then be represented by a unitary or antiunitary operator 
Ur of H, that is Ty = Ur for all w.? 

The observables are self-adjoint (Hermitian) operators A on KH, typically un- 
bounded. Being self-adjoint, their spectrum is real. The state |y) then also define 
an observable, the projection Py,. The expectation value of the observable A in the 
state |W) is given by 


(Wy, Ay) =trAPy (2.1.62) 


7In particular, connected groups of automorphisms G of H are represented by unitary transforma- 
tions of }(—with the following note of caution: Uy is determined by T only up to multiplication 
by a factor of norm 1. Therefore, in general, we only obtain a projective representation of G, that is, 
we only obtain the group law Ugh = c(g,h)UgUp for some scalar factor c(g, h) of absolute value 
1. It is, however, possible, to obtain an honest unitary representation by enlarging the group G. 
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(assuming that w is contained in the domain of definitio of A). This includes 
(2.1.60) as a special case. When yw is an eigenstate of A, that is, 


Ay =a (2.1.63) 


for some (real) eigenvalue A, then this A is the expectation value of A in the state wy. 
The variance of the probability distribution for the observations of the values of A 
for a system in state w is then 


(wy, A°w) — (W, Av)’. (2.1.64) 


This variance vanishes iff (2.1.63) holds, that is, iff w is an eigenstate of A. That 
means that an observable A takes precise values precisely on its eigenstates. 

Let G be a group, like SO(3) or SU(2), acting on 1 by unitary transformations. 
An observable A is called a scalar operator when it commutes with the action of G. 
Then, if y is an eigenstate of A with eigenvalue 4, for all g € G, 


Agw = gAv = ghW=Agy. (2.1.65) 


Thus, the space of eigenstates with eigenvalue A is invariant under the action of G. 
As such an invariant subspace, it could be reducible or irreducible. In the latter case, 
the degeneracy of the eigenvalue 4 equals the dimension of the corresponding irre- 
ducible representation of G. These dimensions are known by representation theory, 
see [45, 75]. If one then perturbs the operator A to an operator A’ that is no longer in- 
variant under the action of G, the multiplicity of the eigenvalue A will decrease. This 
is important for understanding many experimental results. The operator A might be 
the Hamiltonian Ho of a system invariant under some group G, say of spatial ro- 
tations. Its eigenvalues are the energy levels, and because of the invariance, they 
are degenerate. Ho then is perturbed to H = Ho + H\ by some external magnetic 
fiel in some direction which then destroys rotational invariance. Then the energy 
levels, the eigenvalues of the new Hamiltonian H split up into several values. Often 
H is small compared to Ho, and one can then approximate these energy levels by 
a perturbative expansion of H around Hp. 

We return to the general theory. When the spectrum of A is discrete, and |a) runs 
through a complete set of orthonormal eigenvectors of A, we have the relation 


Y- la) (al =id, (2.1.66) 
la 


the identity operator on 7. Applying this to |y) € H yields 


Yo layaly) =|y). (2.1.67) 
|a) 


which is simply the expansion of |¥) in terms of a Hilbert space basis. When the 
eigenvalue of A corresponding to the eigenstate |a) is denoted by a, that is, Ala) = 
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al|a), we have the relationships 


Alw) = 7 Ala) (aly) =) ala)(aly) and (2.1.68) 


a) 


(PIAL) = 5 (61 Ala) (aly) =e (pla) (aly). (2.1.69) 


a) 


When the spectrum of the operator A is continuous, the sums in the preceding rela- 
tions are replaced by integrals; for instance 


(elAly) = f daatolay(aly). (2.1.70) 


This is rigorously investigated in von Neumann’s spectral theory of (unbounded) 
self-adjoint operators on Hilbert spaces. Let us briefl describe this, referring e.g. 
to [100, 110] for details (the knowledgeable reader may of course skip this). 

A family of projections E(a) (—oo < a < ov) in a Hilbert space H is called 
a resolution of the identity iff for alla,b eR 


1. 
E(a)E(b) = E(min(a, b)), (2.1.71) 
5: 
E(-co)=0, = E(oo) = Id (2.1.72) 
(here, E(—0o)|$) := limg, oo E(a)|), E(0o)|) := limatoo E(a)|o) for |) € 
J), 
E(a+0) = E(a) (2.1.73) 


(E(a + 0)|@) := limp jo E(6)|9)). 


For a continuous function f : R — C, one can then defin 


Ag 
[ f(@) dE(a)|¢) = , i mm is pd. Flor) Blaes) - E(ax))|\6) (2.1.74) 
1 ax |ak41—a, 
for Ay =a, <d2-+++ <a, = Az and ax € (ag, ag+1] (a limit of Riemann sums), and 
[ feat) = ie fladE(a)l¢) (2.1.75) 
Ail— = aie 


whenever that limit exists. This is the case precisely if 


/ | f(a)? dll E(a)|b)|12 <0 (2.1.76) 


where ||.|| is the norm in the Hilbert space H. For such |@), 


ws / F(a) d(WlE(a)|o) (2.1.77) 
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define a bounded linear functional on H. In other words, we have a self-adjoint 
operator G with 


(WIGI¢) = | f(a) d(wlE@)|¢) (2.1.78) 


define for those |@) and all |y) € H. The central result is that every self-adjoint 
operator A on the Hilbert space { admits a unique spectral resolution, that is, can 
be uniquely written as 


A= [ adE(a) (2.1.79) 
in the sense that 


(WiAld) = f aa(WEC@id) (2.1.80) 


for all |¢) in the domain of definitio of A and all |W) € H. This is the meaning 
of (2.1.70). 

On this basis, one can defin the functional calculus for self-adjoint operators 
and put 


f(A) = / f(a) dE(a) (2.1.81) 


for a function f : RR — C when (E(qa)) is the spectral resolution of A. f(A) is 
then also a self-adjoint operator. When f is an exponential function, for example, 
this leads to the same result as definin e4 directly through the power series of the 
exponential function. 

The correspondence between classical and quantum mechanics consists in the 
requirement that the quantum mechanical operators x, p corresponding to position 
x and momentum p satisfy the operator analogs of (2.1.12), that is, 


[t/, £/]=0=[f;, pj] and [%/, py] = indi, (2.1.82) 
with the commutator of operators, 
[A,B] =AB—BA (2.1.83) 


in place of the Poisson bracket. The factor i in (2.1.82) comes from the fact that the 
commutator of two Hermitian operators is skew Hermitian. 

|x) then denotes the state where the particle is localized at the point x € M, ie., 
the probability to fin it at x is 1, and 0 elsewhere. When M is the real line R, that 
is, one-dimensional, this is an eigenstate of the position operator <, that is, 


x|x) = |x)x (2.1.84) 


corresponding to the eigenvalue x. In RZ, the components x',i=1,...,d, are the 
eigenvalues of the corresponding operators <', and 


Ki |x) = |x)x!. (2.1.85) 
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One should note that these operators are unbounded, and in our above L?-space, 
the states |x) are represented by Dirac functionals 5(x), that is, they are not 
L?-functions. Functional analysis provides concepts for making this entirely rigor- 
ous. In fact, according to spectral theory as presented above, we consider the Hilbert 
space L?(R) and write the operator as 


Al) (x) = x1) (x), (2.1.86) 
which then admits the spectral resolution 
A= [ adz@) (2.1.87) 
with 
E(a)|6)(x) = | A ersaiint (2.1.88) 
orx >a. 


Returning to (2.1.84), we see that the spectrum of the position operator x on R 
consists of the entire real line. 

The established notation usually leaves out the hats, that is, writes x both for the 
position at a point in M and the corresponding operator on H that has been called x 
in (2.1.84). We shall also do that from this point on. 

With these conventions, the Schrédinger equation (2.1.14) becomes 


0 
ihe lw) = Aly @). (2.1.89) 


H, the Hamiltonian, here is a self-adjoint (Hermitian) operator, that is, an observ- 
able, in fact the most basic one of the whole theory. 
The solution of (2.1.89) can be expressed by functional calculus as 


lW(t) =e" ht |w(O)). (2.1.90) 


Here the exponential of —H is define through the usual power series of the expo- 
nential function, or better, via (2.1.81). Since H is Hermitian, the operators e nth 
are unitary. Thus, the state |y) evolves by unitary transformations. 

By taking 2 of (2.1.90), we see that formally it satisfie the Schrédinger equation 
(2.1.89), indeed. From (2.1.90), we also infer the semigroup property 


i 


|W) = eT REDE ay (7) = eT REA |e“ KTH (Q)), (2.1.91) 


Thus, the solution at time ¢ is obtained from the solution at time t by applying the 
solution operator for the remaining time ¢ — t (0 < t <r). We express the rela- 
tion between (2.1.110) and (2.1.109) also by saying that —;H is the infinitesima 


generator of the semigroup e~7'4 | For an account of the mathematical theory of 
semigroups for partial differential equations, we refer to [63]. 
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In this Schrédinger picture, the states evolve in time, whereas the observables 
don’t. In the Heisenberg picture, this relation is reversed. The states are time- 
independent, whereas the operators representing the observables change in time, 
according to 


nf! K 1A, HY (2.1.92) 
i rahe wl. 
whose solution is 
A(t) =er#t Ae FH, (2.1.93) 


The Schrédinger and the Heisenberg picture are equivalent insofar as they yield the 
same probability density for the outcome of observations. This is expressed by the 
relation 


(GIAO IW) = (GOer# AOE F#"|yY-(0)) = (POIAMIW(0)) (2.1.94) 


where we have used (2.1.90) and (2.1.93). 
From (2.1.92), we also see that A is conserved precisely if it commutes with H. 
In that case, the quantity in (2.1.94) is constant in time, equaling 


($(0)|AO)|¥ )). (2.1.95) 


Experimental interactions are formally described by the S-matrix. It is assumed that 
a state is prepared to have a definit particle content a for t + —oo (that is, before 
the interaction takes place); this is the in state wy. The interactions take place at fi 

nite time, and one then measures the out state yw, with particle content 6 for t + co 
(that is, after the interaction has taken place). Then, the probability amplitude for the 
transition is 


Spa = (We > Wa )- (2.1.96) 


The Sgq are the components of the S-matrix. It is assumed here that these values are 
computed for complete sets of orthonormal in and out states, so that the S-matrix 
has to be unitary. 

We consider once more the Hamiltonian (2.1.9). x is the position operator, and 
a particle that is located at a point x € M (note that we use the same symbol for 
the position and the position operator) is then in the eigenstate |x) of the position 
operator (note that this eigenstate in general will not be contained in the Hilbert 
space L?(M), but is instead given by a delta functional 6,). If the particle is in the 
state |x) at time 0, then by the solution of the Schrédinger equation (2.1.90), at time 
t it will be in the state 

ent) yy (2.1.97) 

More generally, the probability amplitude (x”, 1” |x’, t’) that a particle starting at x’ 
at time t’ will be at x” at the time ¢” > ¢’ is given by 


(xl txt!) = (eT RO) py (2.1.98) 
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the projection of the state e Raat) |x’) obtained from the solution of the 
Schrédinger equation onto the state |x”). 
By formal functional calculus, this is expressed as a Feynman functional integral 


(x eT HOP) ty / Dx exp( ZL) (2.1.99) 


. t! 
= [ dvexo(= [ (Fir V¢v(e)) dr J2.1.100) 
t=t’! 


for our standard example (2.1.7). Here, Dx symbolizes a formal measure on the 
space of all paths starting at time ¢’ at x’ and ending at time r” at x”, according to 
the interpretation given in many texts. One should point out here, however, that this 
measure by itself is not well defined What one can hope to attach a mathematical 
meaning to is only the entire integrand Dx exp(iL(x)) in (2.1.99) as a functional 
measure on the path space. This is in analogy with the Wiener measure where one 
considers the probability density p(x”, t’’|x’, t’) for a particle starting at time r¢’ at 
x’ and ending up at time ¢” at x” under the influenc of the potential V(x), that is, 
governed by the Lagrangian (2.1.7), 


F=—kKP-V@), (2.1.101) 


The probability density evolves according to the heat equation 


ve) = mA¢(x,t) — V(x) d(x, ft). (2.1.102) 


For comparison, we recall the Schrédinger equation (2.1.14) for the Lagrangian 
(2.1.7), 


2 
pO 2 ni a eng ey, (2.1.103) 
OT 2m 


Let us assume fi = m = | to make the comparison a little simpler. Then, in fact, 
setting tT = —it transforms (2.1.103) into (2.1.102). Thus, the Schrédinger equation 
is the heat equation for imaginary time. With this change of time (called analytic 
continuation or Wick rotation in the physics literature), the corresponding functional 
integrals are also transformed into each other. This is useful at the formal level, but 
perhaps not as much so for the more detailed mathematical analysis. 

Returning to (2.1.102), Wiener then showed that, under appropriate conditions 
on V, the solution can be represented as a path integral 


pea" tal t)= f Deexp(-L0) 


1! 
= [ vvexo(- [ (Fume vite) ar). (2.1.104) 
a 
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For the discussion to follow, it will be convenient to rewrite (2.1.104) slightly as 
p(x", t"|x’,¢’) = oa exp(—L(x)) (2.1.105) 


to indicate the initial and terminal points of the paths over which we integrate. One 
then has the property 


(x,t |x’, t) = (x", t" |x, t) p(x, tx’, dx (2.1.106 
Pp P P 
x 


for every t/ < t < tr”, which simply expresses the fact that every path leading from 
x’ at time t’ to x” at time ¢” has to pass through some x at the intermediate time f. 
Thus, we may cut the path at time ¢ and integrate over all possible cutting points x. 
In terms of functional integrals, this becomes 


poo" tl a't) = f ax [ost exp(—L(x)) f Lx” exp(—L(x)).(2.1.107) 


The difference between (2.1.102) and (2.1.104) is the i versus the —1 in the expo- 
nent in the integral. In the Wiener case, the minus sign leads to a rapid dampening 
of the influenc of those paths with large values of the Lagrangian action, and to 
a concentration of the functional measure near the minimum of the action. In the 
Feynman case, in contrast, paths with large values of the action cause rapid fluctu 
ations in the integral, making the analysis substantially harder, see [2]. We do not 
enter the details here. For more on this, see [49]. 
In analogy to (2.1.106), we have the cutting relation 


(x eR AO) 


= frosty exp( 7209) 
= [ax frosty, exp(5L0) frp” exp( 52009) (2.1.108) 


fort’ <t<t". 
Written more abstractly, this is the analog of (2.1.106), 


x" tx’ t= / dx x", tx, t) (x, t\x’, t’). (2.1.109) 
x 
This fit together well with the operator formalism as in (2.1.70). In particular, we 


can now insert a position operator (writing x(t) in place of x(t) as announced above) 
and compute 


a On = (Dat xmexp( 509) 


= f axe" ates cette’), (2.1.110) 
x 
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Here, the x(f) in the integral is a number,’ not an operator. That number in the inte- 
gral then translates into the operator x(t) in the inner product on the I.h.s. In physics, 
these inner products are viewed as the matrix elements of an infinite-dimensiona 
matrix. 

This process of cutting the path in the integral can be iterated, and we can insert 
two intermediate positions x(t,), x(f2) (or more, but the principle emerges for two 
already), to get 


/ (Dally x(ndxte)exp( F409) = (x0 x(o)x(ix.t). (21.11) 


Here, in the integral, the temporal order of f, and fp is irrelevant because in the 
integral, x(t), x(t2) are real numbers.’ In the r.h.s. of (2.1.111), however, they are 
operators, and since operators in general do not commute, the order does matter 
here. Since the paths are traversed in increasing time, we need to put them into 
the temporal order, that is, always apply the operator corresponding to the smaller 
time first This is called temporal ordering. Formally, one can defin the temporally 
ordered operator 


— Jx()x(n) ifti <h, 
T [x (t2)x(t1)] = set. Fees (2.1.112) 


and write the rh.s. of (2.1.111) as 


(x”,¢”"|TLx(b) x(q) 1x", 7). (2.1.113) 
We may also write 
T [x (t2)x (t1)] = O (2 — 11) x (t2)x (1) + O(t — t2)x(t1) x (12) (2.1.114) 
where 
1 fors>0, 
Og ae ee (2.1.115) 


is the Heaviside function. Considered as a functional, its derivative is the Dirac 
functional, 


# 9g 55). (2.1.116) 
ds 


3More precisely, when the path x takes its values in Euclidean space R?, x(t) is a vector with d 
components. The operations in (2.1.110) and subsequent formulae are to be understood for each 
component. In particular, when we later on, in (2.1.111) and subsequently, insert expressions like 
X(t1)--++X(tm), this is understood as the vector (x! (t))-- -x! (C7 x4 (t1) oo -x4(tm)) obtained 
by componentwise multiplication. 


4See the preceding footnote. 
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We now make some general observations about functional integrals of the form 
fiom exp (5200). (2.1.117) 


In place of the position operator x(t), we can also insert other operators f(x) in 
(2.1.110). The formula becomes 


"i feols'st) = [Dat - “rosvexo( 5100). (2.1.118) 


Again, on the I.h.s., f(x) stands for an operator, on the r.h.s. for a number. 

The analogy between ordinary (Gaussian) integrals and functional integrals says 
that the finitel many ordinary degrees of freedom, the coordinate values of the inte- 
gration variable, are replaced by the infinitel many function values x(t). Therefore, 
integration by parts should yield that 


6 i 
Fe 
0= [tnx ee x (F200), (2.1.119) 
that is, the integral of a total derivative vanishes. This yields 
i y i 
j= > ftosi% : ‘exp( F400) mae: (2.1.120) 
Recalling (2.1.118), this is written as 
6L 
eg i 2g, (2.1.121) 
8x(t) 
Now, Sits) (py Tepresents the Euler-Lagrange operator (see also (2.3.8) below), and 
6L 
Oy (2.1.122) 
5x(t) 


is the classical equation of motion. Comparing (2.1.122) and (2.1.121), we see that 
the classical equation of motion is translated into an operator equation in the quan- 
tum mechanical picture. In this interpretation, x’, t’ and x”, t” represent arbitrary 
initial and fina conditions for our paths x(t). 


Returning to our integration by parts, (2.1.119) generalizes to 
i t i 5 L(x) 
= f wots exp( F209) 2? pen 


xp" ry j 
= frosty man (o7(F2) roo 


=- - fips a ‘exp( 5 La) fe. (2.1.123) 
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When none of the field (see below) contained in f(x) is evaluated at time f, the 
r.h.s. vanishes. This, of course, confirm the interpretation of (2.1.122) as a quantum 
mechanical operator equation. 

Naturally, we are now curious what happens when some of the field are present 
at time ft. So, we insert x (fo) for some t’ < fo < t”. This yields 


i [Daly 4 x(t exe( Lay) ale 
2 ; 
= [ios 7 san (P( FE) co) 
= x!t! 
“ -| [Dx]! ‘exp( Lay) at 


—_ fom exp( FL 3c =i. (2.1.124) 


As an operator equation, this is interpreted as 


— x(t0) Lx) = (t= 1) (2.1.125) 


As remarked above, Sh) i) represents the Euler-Lagrange operator (2.3.8). We con- 


sider here the example 4 ce , and the classical Euler-Lagrange equation becomes the 
operator equation 


dad 
Gai) =0. (2.1.126) 


From (2.1.114), (2.1.116), (2.1.126), (2.1.45), we obtain 


2. 


d ; 
77) T [x (t2)x(t1)] = 6(t2 — no x(t), ve) |= = —if8 (ty — t)) (2.1.127) 
using the Heisenberg commutation relation (2.1.12) for the last equation. This, of 
course, coincides with (2.1.125). In Sect. 2.5.1 below, this equation will lead us to 
the normal ordering scheme for operators. 
A recent reference on path integrals is [112]. 


2.1.4 Quasiclassical Limits 


In this section, we briefl discuss some analytical aspects of the relationship be- 
tween classical and quantum mechanics. 

In classical mechanics, stable equilibria are characterized by the principle of lo- 
cally minimal potential energy, whereas dynamical processes are described by the 
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principle of stationary motion. Both are variational principles. We consider a physi- 
cal system with d degrees of freedom x!, ..., x“. We want to determine the motion 
of the system by expressing the x! as functions of the time t. The mechanical prop- 
erties of the system are described by the kinetic and the potential energy. The kinetic 
energy is typically of the form 


d 
T= » As secgh” OR A: (2.1.128) 
igs! 


Thus, T is a function of the velocities x!,..., x4, the coordinates x!,...,x%, and 
time t; often, T does not depend explicitly on t, and one may then investigate equi- 
libria. Here, 7 is a quadratic form in the generalized velocities x!,..., x4. 

The potential energy is of the form 


V=Vcxl,...,x%,1), (2.1.129) 


that is, it does not depend on the velocities. 

In order not to have to worry about the justificatio of taking various derivatives, 
we assume that V and 7 are of class C?. 

Hamilton’s principle now postulates that motion between two points in time fo 
and ¢; occurs in such a way that the Lagrangian action 


t 
L(x) =) ‘(T— V)dt (2.1.130) 
i) 


is stationary in the class of all functions x(t) = (x! (t),...,x@(t)) with fi ed initial 
and fina states x(t) and x(t,) respectively. The Lagrangian action is the integral 
over the Lagrangian 


F(t,x,x):1=T —V, (2.1.131) 


the difference between kinetic and potential energy. 

Thus, one does not necessarily look for a minimum under all motions which carry 
the system from an initial state to a fina state, but only for a stationary value of the 
integral. For such a stationary motion, the Euler-Lagrange equations hold: 


d oT 0 


-(T—V)=0 fori=1,...,d. 2.1.132 
dias ag@\F —Y? oS 
If V and T do not explicitly depend on the time ¢, then equilibrium states are 
constant in time, that is, x' = 0 fori =1,...,d, and hence T = 0, therefore by 
(2.1.132) 
aV 
-=0 fori=1,...,d. (2.1.133) 
ox! 


Thus, in a state of equilibrium, V must have a critical point, and in order for this 
equilibrium to be stable, V must even have a minimum there. That minimum, how- 
ever, need not be unique. For example, when the state space is simply the real line 
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R, and 
V(x) = (x? — a’)? (2.1.134) 


for some a € R, the classical equilibrium states are x = ka. 

Quantum mechanically, we have an L?-function ¢ : R — C (but, for simplicity, 
we shall consider real-valued functions ¢@ in the present section) with the normal- 
ization 

lollp2 =1 (2.1.135) 
and the asymptotic behavior 


lim $x) =0. (2.1.136) 


The potential energy V (x) is now replaced by the energy 


LC 


The corresponding Euler-Lagrange equation is 


do (x) 
dx 


2 
+ VaplbO9P Jas. (2.1.137) 


Wd? 
Se AZ =0. 2.1.138 
( aa «)p (2.1.138) 
Since there is no kinetic term in (2.1.137), quantum mechanics tries to fin the 
eigenfunctions of the operator in (2.1.138), the Hamiltonian. That is, we look for 
solutions of 
ia 

(-Satve)e age (2.1.139) 
The eigenvalues £; are the energy levels. A solution ¢o for the smallest possible en- 
ergy Ey corresponds to the vacuum. E% is positive since the potential V is positive, 
see (2.1.134). The solution ¢9 (normalized by ||@o|| ;2 = 1 according to (2.1.135)) is 
symmetric, that is, do(x) = ¢o(—x), with maxima at +a, a local minimum at 0, and 
asymptotic decay limy_, +00 @(x) = 0 required by (2.1.136). The eigenfunctions for 
different eigenvalues are L?-orthogonal. In particular, the eigenfunction ¢; for the 
second smallest energy level E} satisfie [ ¢9¢1dx = 0. It is antisymmetric, that is 
(x) = —¢1(—x) and thus changes sign at x = 0. In the quasiclassical limit, that 
is, for h > 0, we have 


Eo, E, ~ ah (2.1.140) 
and 


E ~ £o~exp(-4) (2.1.141) 


for constants a, co,c,. Thus, the difference between these energy levels goes to 0 
exponentially. Therefore, in that quasiclassical limit, the energy levels of @o and ¢ 
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become indistinguishable. Thus, for i — 0, the limits of ¢4 := 5 (bo + ¢)) also 
become minima, with 


lim |p? = 8(x Fa). (2.1.142) 


g4+ and @_ break the symmetry between a and —a. Classically, any linear combi- 
nation of 6(x — a) and 6(x +a) is a possible minimum. The quantum mechanical 
vacuum, however, is symmetric. 

It is also instructive to consider a nonlinear problem. We take 


h2 
IG 
This time, ¢ need not be real-valued, but could assume values in some other space, 
like a Riemannian manifold N. We firs consider the real-valued case. The domain, 
denoted by M, however, is allowed to be of higher dimension. We suppose again 
that the potential W has two minima (W is then called a two-well potential). Now, 
in the quasiclassical limit, we do not obtain a concentration at two points, the two 
minima, in the domain, but rather the concentration at two values of @. This time, in 
contrast to the linear case, the symmetry can also be broken for fi > 0. Since this is 
not a linear problem, we no longer have the concept of eigenfunctions available. For 
h — 0, the solution becomes piecewise constant, the values being the two minima 
of W, of course. When one imposes suitable constraints, by a result of Modica [82], 
the set of discontinuity of the limit for i > 0 of the solutions for h > 0 is a hy- 
persurface of constant mean curvature in the domain. Of course, this is meaningful 
only for a higher-dimensional domain. 
When the domain is one-dimensional, that is, the real line R, but the target is of 
higher dimension, we may have quantum mechanical tunneling solutions, i.e., 


do (x) 
dx 


2 
4 Wel) Ja (2.1.143) 


lim $(x) =az (2.1.144) 


X00 


between the vacua, that is, minima of W, denoted by a;, a_. These tunneling solu- 
tions are gradient fl w lines of W when the target is a Riemannian manifold. 

There exist some generalizations of this problem that lead to analytical construc- 
tions of great interest: 


1. Let L be a real line bundle over M\S where S is a submanifold of codimension 
2 in the domain M, with prescribed holonomy for LZ around S. A section of L 
then has a zero set of codimension | in M with boundary S. Considering the 
quasiclassical limit of those zero sets for minimizers of the above functional for 
sections of L yields a minimal hypersurface in M with boundary S, according 
to [44]. 

2. Let L now be a complex line bundle over M. The above functional then leads 
to a vortex equation of the type studied by Taubes [99], Bethuel et al. [13], and 
Ding et al. [28-30]. 
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2.2 Lagrangians 
2.2.1 Lagrangian Densities for Scalars, Spinors and Vectors 


A type of particle is represented by a vector bundle E over some Lorentz mani- 
fold M. The particle transforms according to some representation of the Lorentz 
group or its double cover, the spin group.° Thus, it transforms as a tensor or as 
a spinor. The states of collections of such particles are represented by sections y 
of E, so-called fields ° 

We are considering here the semiclassical situation, i.e., before fiel quantization, 
and so w has to satisfy the Euler-Lagrange equations of some action functional that 
is invariant under the representation of the Lorentz or spin group according to which 
the particle transforms. In addition, there are internal symmetries that affect only 
the values of the fields but not of the coordinates, and leave the action invariant. In 
fact, the symmetries and certain general considerations often suffic to allow us to 
construct the appropriate Lagrangian for the action as we shall see. 


Pe — _d = 
Notation: 0,, = y77, 0% = g* "dy. 
For the moment, we can think of g,,, as a (Lorentz) metric on R!3, and the 
indices yz, v then run from 0 to 3. 


We consider the action functional (Lagrangian)’ 


1 1 
S(o) = [{Za.00"e - smo Jee) (2.2.1) 


for a free scalar field with the Lagrangian density 
1 1 4,5 
F(o, Do) = 5 oupa"g ye (2.2.2) 


on R!-3, or, more generally, on some Riemannian or Lorentzian manifold. (Note 
that in (2.2.1), we use ./—g(x)d(x) for the volume form, since we are assuming 
a Lorentzian metric. Subsequently, when we switch to the Riemannian case, the 
minus sign has to be deleted.) 

The corresponding Euler-Lagrange equation is the Klein—Gordon equation 


p+mo=0 
where CL is the Minkowski Laplacian (1.1.106). 


5In fact, according to Wigner’s principle as explained in Sect. 1.3.4, we should consider a particle 
as an irreducible unitary representation not only of the Lorentz or spin group, but of the Poincaré 
group or the double covering S/(2, C) x R!3. While this is fundamental for determining the types 
of possible elementary particles from the theory of group representations, in this section, we shall 
be mainly concerned with internal symmetries that arise from invariance w.r.t. to the action of some 
compact group. 


®A section represents a state containing possibly several particles of a given type, since in quantum 
fiel theory, particle numbers need not be preserved. 


7In the mathematical literature, an action functional is often called a Lagrange functional. 
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Remark 


1. As a classical action functional for a field we consider the action for a particle 
q(t) =(q\(t),...,q""(t)) with m degrees of freedom 


m 1 ; ; 
i >| 54 (t)gj(t) — m?q/ (tq; cy} ar. (2.2.3) 
j=l 


When we compare this with our quantum fiel theoretic setting, we see that 
the index j corresponds to the spatial variable (x!, x*, x7) above. In this sense, 
o(t, x) is a particle with infinitel many degrees of freedom, one degree of free- 
dom for each point of M. 

2. In the physics literature, the fiel @ in (2.2.1), (2.2.2) is usually taken as complex 
valued instead of real valued, that is, one considers 


1 - dl 
si(6)= [| 39.0088 — SmAiei?| Vee, (2.2.4) 


in line with the basic formalism of quantum mechanics, see (2.1.13). Our reason 
for starting with a real valued ¢ here is, besides its simplicity, that this is better 
suited for subsequent generalizations to nonlinear models where the fiel will 
take its values in a Riemannian manifold. 


We now turn to Lagrangians for spinors. 
For two left-handed spinors (see (1.3.49)) @, x, 


OX = bapb?x? 
transforms as a scalar under the spinor representation, see (1.3.56). 
Similarly 
pron,x 


transforms as a vector, for uw = 0,1, 2,3, see (1.3.57). We may then write a La- 
grangian for a left-handed spinor ¢ as 


F =Re(ido"d.6 + 2mo) 
i - = 223 
= 5 Gor dud — dnpo"h) + moo + $¢) (2.2.5) 
(here, ¢ is the complex conjugate of ¢—subsequently, we shall employ a somewhat 


different convention when we consider full spinors). 
The equation of motion for 


s@)= | F@) (2.2.6) 
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is 

idugo" —mp=0, (2.2.7) 
or equivalently 

io“ dug +md=0. (2.2.8) 


In quantum fiel theory (QFT), charged particles correspond to complex-valued 
fields and the Lagrangian has to remain invariant under multiplication of the field 
by e'*(A € R) since the phase is not observable. Instead of imposing the normal- 
ization ||¢|| = 1, we can then consider states as corresponding to lines in a Hilbert 
space. 

The preceding Lagrangian satisfie this invariance property only for m = 0. Since 
it does not have this property in general, it corresponds to a neutral fermion. In the 
standard model to be discussed below, these neutral fermions are the neutrinos. 

In order to obtain a Lagrangian for charged fermions, we need full spinors 


v= (¥): 


Then in the Weyl representation, 


vy 
transforms as a vector, see (1.3.61). 
The Dirac—Lagrangian is then 
F=iby" dy —mpy =i, Dy) —mby (2.2.9) 


(recalling the Dirac operator — define in (1.3.22)). The mass term mixes the left 

and the right spinor, since Ww = WiWr t+ WrW_. This time, we do have invariance 

under multiplication of y by e!* for constant real A also in the general case m £0. 
The corresponding Dirac equation is 


iy“ d,v —myp =0. (2.2.10) 


Perhaps the factor i in (2.2.9) in front of the Dirac operator D = yd, needs some 
explanation. The reason is that, upon integration, the corresponding term is purely 
imaginary, and the factor i then makes it real. It is instructive to consider an example, 
and since we shall mainly investigate the Riemannian in place of the Lorentzian 
setting in the sequel, we shall also use a Riemannian example. As in our treatment 
of the supersymmetric sigma model below (2.4.3), we consider the two-dimensional 
case and use the following representation of the Clifford algebra C/(2, 0): 


-1 0 0 -!il 
el ( 0 ap e—> @ 0 D (2.2.11) 


which is different from the one described in Sect. 1.3.2 (but of course equivalent to 
it). A spinor @ is thus identifie with an element (@; = a + if), #2 =a2 + if2) of 
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C?. In local coordinates x, y, then 


—-1 0\ 0 fa; 0 —-1)\ 0 (a, 
fad = os aa 
no (0 thar(e)*( 0) ay (ee) 


lea + i a6 day + i i) 


ox x dy dy 
a ee ‘i F i (2.2.12) 
ry +i ae ra i ay 
and 
c 0a 0a2 Op, dB2 0a2 0a 
2a = 
ee w Ox ms oy Bi Ox Pi dy a2 Ox 2 dy 
dB2 Opi .f, day daz Opi daz daz 
+ fa Pa, +i(6i a ay a + Bi By Bo ae 
a a a 
ini pes ig (2.2.13) 
0 ox oy 


Upon integration, the real part vanishes by integration by parts, and only the imag- 
inary part remains. This comes about because the coefficient of w commute. Were 
they to anticommute, only the real part would remain. We are making this observa- 
tion here because in our subsequent treatment of supersymmetry, we shall use spinor 
field with anticommuting coefficients 

We have now seen action functionals for scalars and spinors, where these names 
describe the transformation behavior under Lorentz transformations, i.e., coordinate 
changes. An electromagnetic field however, is described by a potential that trans- 
forms as a vector or covector. We consider 


A=Ay(x)dx". 


A is called a vector particle, because A“ transforms as a vector. Mathematically, 

A is a connection, see (1.2.12), on a vector bundle with fibe C and the Abelian 

structure group U(1) = SO(2). We also recall the transformation behavior (1.2.32). 
The fiel strength is described by the tensor 


Fuv = 0y,Ay — Ay, 


(2 times) the® curvature of the connection A, see (1.2.23) (note that the brackets 
[A,,, Avy] vanish here, because the structure group is Abelian), and the correspond- 
ing Lagrangian is the Maxwell density (the Abelian case of the Yang—Mills density) 


1 I 
= 7 Fu FH” = 5 @y Aya" AY — By Aya" A"). (2.2.14) 


8Note the different conventions between the present section and Sect. 1.2.2. 
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An important property of this Lagrangian is the gauge invariance, namely its invari- 
ance under replacing A,, by 


Ay + Oné 


where & is a scalar function. This is the present, Abelian, version of (1.2.13), 
(1.2.32). Of course, the fiel strength F,,, is already invariant under such a gauge 
transformation (see (1.2.25), (1.2.33) for the general result). 

The equations of motion for 


S(A) = -7 | for” (2.2.15) 
are 
On. FY" =0. (2.2.16) 
If we add a “mass term” 
mA, A", 


then the gauge invariance no longer holds. 

As described, the mathematical interpretation of A is that of a covariant derivative 
for sections of a line bundle, see Sect. 1.2.2. Thus, for a scalar fiel @ taking values 
in this bundle, we put 


(dad) u = InP 1 Aud, 


and we may consider the interaction Lagrangian 


1 1 
5 nb + Aug) 0" G* + (AMO)*) = FIIdagll”. (2.2.17) 


Here, we assume that the line bundle is Hermitian, and for simplicity, we write the 
metric as ||||7 = @*; of course, in general this only holds in suitable coordinates; 
we also assume that A is unitary w.r.t. this metric—we shall return to this point in 
a moment. 

The replacement of 


dnd with ,6+ Ayo (2.2.18) 


is for the following reason. The Lagrangian 
1 1 4 
7 nga" a 5 og" 
is invariant under U(1), i.e., under replacements 
gre’ withd eR. 


It thus has a global internal symmetry. It is not invariant, however, under general 
local symmetries, i.e., 


rw) es ely 
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if #(x) is a nontrivial function of x. However, if, according to (1.2.14), we also 
replace” 

Aut Ay — 10,0, 


then the above interaction Lagrangian (2.2.17) remains invariant. Thus, we have 
a gauge invariant Lagrangian. The procedure (2.2.18) of replacing an ordinary by 
a covariant derivative is called minimal coupling. 

In fact, we have a free parameter here: We consider the exterior derivative d as 
the trivial connection (“vacuum”) on the trivial bundle M x R. We can then view 
the affin space of connections A as the vector space Q!(M) of 1-forms. We can 
therefore multiply A by some factor g and choose the covariant derivative 


Da:=0+qA (2.2.19) 
and gauge transform A to 
A= a9. (2.2.20) 
q 


q here is interpreted as the charge of the electromagnetic field It is the Noether 
charge associated to the U(1) gauge symmetry. 

The full Lagrangian for a complex scalar fiel @ interacting with an electromag- 
netic fiel A is 


1 1 1 
=||d,o\|? — —m? |||? + —<|| FI’. 2251 
5 AQ || 5m Il\ +352 Il Fl ( ) 


The same discussion applies to spinor field w, and we may form the interaction 
Lagrangian 


“ a 1 
ivy" (a, + Ay —mby + gga (2.2.22) 


Let us see the details once more: Replacing w(x) by e!”) w(x) changes the spinor 
Lagrangian 


ivy" dw —mypw (2.2.23) 


by — Wy" vay, and this is again compensated when we replace 0, by 0, + GAu 
and require that A transforms to A — 00 as before. Thus, 


ivy" (dO, + Aww —mbw (2.2.24) 


remains gauge invariant. 


°Note that the convention here is different from the one in Sect. 1.2.3; here, elements of the Lie 
algebra u(1) of U(1), and similarly of other Lie algebras g, are written as iv, with a real 3. This 
will lead to various factors i and —1 when compared to Sect. 1.2.3. This is the standard convention 
employed in the physics literature. 
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As was realized by Yang and Mills,!° this can be generalized to an arbitrary in- 
ternal symmetry group G with Lie algebra g, and a fiel @ that takes its value in 
a vector bundle (or, similarly, in a spinor bundle—the physically more important 
case, see Sect. 2.2.3 below)!! with structure group G. The mathematical formalism 
for this has been described in Sect. 1.2.3. In abstract physical terms, the gauge prin- 
ciple says that the symmetries should determine the forces. The particles conveying 
these forces are called gauge bosons. 

To implement this, we simply consider A = A,,dx", a 1-form with values in 
g, and form the covariant derivative (1.2.12) of the fiel  , a section of the vector 
bundle with structure group G on which A operates as a covariant derivative, 


dag =do+ Ad. 
The replacements 
o(x) g(x)o(x), with g(x) € G for all x, 
i.e., g is an element of the group of gauge transformations, and (1.2.32), that is, 
At> gAg ! — (ag)g7! 


then leave 


IIda@ll? 


invariant (assuming of course that the metric || - || is G-invariant). 
The gauge fiel strength is now (two times) the curvature (1.2.22), (1.2.23) 


Fuv = Oy Ay — 0yAy + [Ap, Av] 
where [., -] is the Lie algebra bracket of g. 


As before, we may form the Lagrangian involving the Yang—Mills action (1.2.34) 
and coupling it with the action for the fiel @ 


1 1 1 
sllaa@ll? - sm lig” - ig Fyy FY’. (2.2.25) 


The same discussion applies to spinor field w with values in a vector bundle on 
which G acts, that is, sections of S ® E for some vector bundle E over M witha 
G-action, and we may form the interaction Lagrangian 


“ i 1 
ivy" (Oy + Ap)y —mypy — 4,2 Fisk; (2.2.26) 
q 


'0Such ideas were firs conceived by Hermann Weyl. 


'1Of course, a spinor bundle is a vector bundle, but in physics, it is important to distinguish between 
vector and spinor representations, that is, whether a representation of the spin group lifts to one of 
the orthogonal group or not. 
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If the representation of G on our vector bundle is not irreducible, but decomposes 
into subrepresentations indexed by j, we may use a more general Lagrangian for the 
gauge fiel strength because we can take a combination 


= So yj Te FD OM, 
j 


The y; are the so-called coupling constants. Mathematically, they parametrize the 
ad-invariant bilinear forms on the Lie algebra of G. 


Important remark: It is undesirable to have too many constants whose values 
are not theoretically deduced, but can only be experimentally determined. As we 
have just seen, such a situation comes about if the representation of G under consid- 
eration is not irreducible. One possible solution of this problem would be to suppose 
that there is some larger group G D G in the background with an irreducible rep- 
resentation that induces the (reducible) representation of G, and so determines all 
the constants except one. It may be possible that the symmetry group G cannot be 
experimentally observed because of a symmetry-breaking mechanism that reduces 
G to G. The Higgs mechanism, to be described below, is such a mechanism. 


Let us recapitulate that A is a 1-form with values in g, and the symmetries there- 
fore are two-fold: Under Lorentz or space-time symmetries, the 1-form part is trans- 
formed, whereas under local internal symmetries (i.e., those coming from G), the 
g part is affected. Similarly, @ transforms as a scalar under space-time symmetries 
and by the action of G under local internal symmetries. 

More generally, one may also introduce some nonlinearities into the @ and y 
equations by adding some polynomial terms to the Lagrangian. These polynomial 
terms, however, are constrained by the requirement of renormalizability. In dimen- 
sion 4, the most general renormalizable Lagrangian is 


1 1 1 2 
~Gge TE Fav FM + Slidell? — melo" + ivy" Gp Aww 

+21 |o|4 + go¢ww + lower-order terms. (2.2.27) 
Here, g1, g2, like q, are coupling constants. The term $y is called a Yukawa term. 
A version of (2.2.27) is also the Lagrangian of the standard model to which we shall 


turn after a brief discussion of the scaling behavior of Lagrangians. The gauge group 
of the standard model is SU(3) x SU(2) x U(1). 


2.2.2 Scaling 


An important criterion for fiel theories in physics and variational problems in math- 
ematics is their scaling behavior. That means that one scales the independent vari- 
ables 


xo teysy (2.2.28) 
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where x is n-dimensional, x € R”, and 4 > 0, and computes the resulting scaling 
behavior of the integrals of the fields The starting point is the scaling of the volume 
form 


d"y=i "dx. (2.2.29) 
Also, 
0 0 
dy = tes (2.2.30) 
Putting ¢)(y) := @(y) = d(x), one obtains 
[. ldgox(y)|Pd"y =A? [. |dd(x)|Pd"x (2.2.31) 
and 
/ lba(y) 4d" y =A4 [ p(x) |4d"x (2.2.32) 
R’ Ra 


for exponents p,q > 0. Therefore, the L7-norm of ¢,, a lpr (y)|4d" y)!/4 has 
a scaling behavior dominated by the L?-norm of the derivative 
di, Cfign |dga(y)|Pa"y)!/?, if 


q< for p <n. (2.2.33) 


n— p 
This is exploited in the Sobolev embedding theorem, see e.g. [63]. For instance, in 
the most important case p = 2, in dimension 2, any polynomial in ¢ is controlled by 
f |\d@|, in dimension 3, a polynomial of order < 6, and in dimension 4, only those 
of order < 4. 
In the light of (2.2.29), (2.2.30), the scaling law (2.2.31) can also be interpreted 
in the way that = |d@|* remains invariant if the fiel is scaled as 


oro. (2.2.34) 


In particular, [ |d@|* is scaling invariant in dimension 2; in fact, this integral is 
even conformally invariant in dimension 2, as we shall explore below. In other di- 
mensions, it becomes invariant only after a rescaling of the fiel @ according to 
(2.2.34). In order to compensate for the different scaling laws for fi |\do|?d"x and 
te |p(x)|?d"x, one may also introduce coupling constants that are then scaled ap- 
propriately. To see this, let us consider 


[ (Idol? — m*|(a)P + g11O@)P + galoOl*)a"x; (2.2.35) 
here, m is the mass of the fiel as before. In order that this integral be scaling invari- 
ant, because of (2.2.29), each such polynomial term has to scale with a factor A”. If 


now ¢ scales according to (2.2.34), this leads to 


m— Am, SS sy ieee pe, (2.2.36) 
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Of course, this just re-expresses our discussion of (2.2.32), (2.2.33). For instance, 
in dimension < 4, the integral of the polynomial ¢* is controlled by that of |d@|’, 
and therefore, we can afford a nonnegative scaling exponent for the coupling con- 
stant g. In general, an interaction term is called perturbatively renormalizable when 
the coupling constant scales with exponent 0, and superrenormalizable when it 
scales with a positive exponent. 

Similarly, when we consider a term 


[ _vigltidgl?a"s, (2.2.37) 


we see that for g > 0, the coupling constant y scales with exponent G-ng which 
is negative for dimension > 2. Thus, such an interaction is renormalizable only for 
n = 2, but not for n > 2. This applies to the nonlinear sigma model discussed below 
(see (2.4.27), (2.4.28)), 


I ij (P(X) on ax v2) 
Re Xx Ox 


where g;; denotes the metric tensor of the target N. When we expand (in normal 
coordinates) 


gij(b) = Si + sie! + higher-order terms, (2.2.39) 


we see that this model is not renormalizable for n > 2. 
Next, if we have a Dirac term for a spinor fiel as in (2.2.9) in the Lagrangian, 
that is, if we have an action of the form 


[ i(w, Dw)d"x, (2.2.40) 


we need the scaling behavior 
porty (2.2.41) 
to make it invariant. When we then have a mass term 
my, (2.2.42) 


we obtain once more the scaling law m > Am as in (2.2.36). 
We finall consider gauge field as in (2.2.27), 


with curvature F = qdA + q?A A A, by (1.2.22). We can then expand the La- 
grangian action (2.2.21) (in shorthand notation, as we are only interested in the 
growth orders), 


1 
— | ||Fld"x= if (dA)? + gA*dA+q7A*)a"x. (2.2.44) 
4q Re 4 Rn 
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As above, from the firs term we get the scaling law (2.2.34) 

A> iM A, (2.2.45) 
which then, taking always (2.2.29) into account, yields 


g>i Fa. (2.2.46) 


Thus, a gauge fiel Lagrangian action is renormalizable for dimension 4, but not 
above. The reader will then check that the same applies to the full interaction La- 
grangians (2.2.21) and (2.2.26). Thus, we see that the Lagrangian action define by 
(2.2.27) is indeed perturbatively renormalizable in dimension 4. 

In contrast to this, let us consider the Einstein—Hilbert functional (1.1.163) of 
general relativity, 


1 
——— R(g)d" 2.2.47 
— [ _R(g)d"x (2.2.47) 


for the scalar curvature R of the metric g. Here, we have introduced the factor 
im that we had neglected in the discussion of (1.1.163) above, G being Newton’s 
gravitational constant. Also, we write d”x for the volume form because we expand 
around the fla metric, 


gij = bij + vijVG. (2.2.48) 


We then obtain, with a similar shorthand notation as above, for the Einstein—Hilbert 
action (2.2.47) 


1 
a u ((dh)? + /Gh(dh)* + higher-order terms) d"x, (2.2.49) 
IT JiR. 


whence the scaling behavior h > rh as before, and then 
Gai "G6, (2.2.50) 


Thus, the Lagrangian action of general relativity is renormalizable only in dimen- 
sion 2, but not in dimension 4. This indicates that there should be difficultie in 
unifying gravity in dimension 4 with the other forces that are governed by a renor- 
malizable Lagrangian of the form (2.2.27). 

Here, we have only discussed perturbative renormalization (using [106]), but not 
nonperturbative renormalization, which is a more difficul issue. Some references 
for renormalization theory are [52, 113]. 


2.2.3 Elementary Particle Physics and the Standard Model 


We now interrupt the process of setting up mathematical structures to discuss how 
this relates to elementary particles and in particular to their the contemporary theory 
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as incorporated in the so-called standard model and its extensions. Physicists will, 
of course, know all this and may skip this section. 

There exist four known basic physical forces: the electromagnetic, weak and 
strong forces and gravity. The standard model includes the firs three of them, but 
leaves out gravity. In fact, it is the fundamental challenge of high-energy theoretical 
physics to construct a unifie theory of all known forces, including gravity. 

In any case, in a relativistic theory of elementary particles without gravitational 
effects, the Lagrangian should be invariant under the action of the Poincaré group 
or the double cover G := SI(2, C) x R!3, see Sect. 1.3.4. This was most clearly for- 
mulated by Wigner who identifie an elementary particle with an irreducible unitary 
representation of G satisfying certain physical restrictions, like m? > 0, where m is 
the mass. This principle is still fundamental, with the modificatio that one needs to 
consider groups that are larger than G, in order to account for internal symmetries of 
the particles beyond the spin. The principal for identifying that group combines the 
mathematical theory of group representations with scattering experiments designed 
to break the symmetry. To take an example from quantum mechanics, the Hamil- 
tonian of a particle in a rotationally symmetric potential, H = zo + V(|x|) (see 
(2.1.9)), commutes with the angular momentum operator x x p = Aw x V), and 
therefore its eigenvalues, the energy levels, are degenerate. When an external mag- 
netic fiel is applied, this symmetry gets broken and the energy levels, that is, the 
eigenvalues of the Hamiltonian, become distinct (Stern—Gerlach experiment, Zee- 
man effect). A further splitting of the energy levels of an electron in the presence of 
an external fiel is caused by its spin. 

When it became clear that the proton and the neutron were very similar, except 
for their electrical charge, Heisenberg suggested that there was a single underlying 
particle, the nucleon, with a so-called isotopic spin, for short isospin, symmetry that 
was broken in the presence of electromagnetic interactions. This should correspond 
to the L = 5 representation of SU(2), as described at the end of Sect. 1.3.4. The 
proton and the neutron should correspond to the eigenvalues 5 and -5 of h = —if3. 
The subsequently discovered pions 2+, 2°, 2~ should likewise correspond to the 
representation for L = 1 with the eigenvalues 1,0,—1 of h. This was supported 
by pion—nucleon scattering experiments. In those scattering experiments, the total 
charge Q as well as the total baryon number B was conserved (proton and neuron 
have B = 1, the pions have B = 0). In order to also incorporate the decay properties 
of other particles, Gell-Mann and Nishijima introduced another quantum number S, 
called strangeness, to be also preserved. There is a fundamental relation between 
the preceding numbers 


O=h+5(B+S). (2.2.51) 


Gell-Mann and Ne’eman then interpreted this as a consequence of the embedding 
of the isospin and the “hypercharge” B+ S into the larger Lie group SU(3) of 
“fl vor” symmetry, and this led Gell-Mann to suggest the existence of “quarks”, 
particles corresponding to the basic representation of SU(3) on C?. Finally, “color”, 
another internal SU(3) degree of freedom, was proposed. The modern theory of the 
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strong interaction was then called quantum chromodynamics, after the Greek word 
for color. 

We now turn to the unificatio of the fundamental forces. Electromagnetic and 
weak interactions (responsible for certain decay processes, like the beta decay of 
neutrons in nuclei) had been unifie earlier, in 1967, in the so-called electroweak 
theory developed by Glashow, Salam and Weinberg, and in the early 1970s, the 
standard model combined this theory with quantum chromodynamics, the theory 
of strong interactions between quarks developed by Gell-Mann, Zweig and others. 
There are two types of particles in the theory: the fermions, which represent matter, 
and the bosons, which transmit forces between the fermions. Fermions have half- 
integer spin and satisfy the Pauli exclusion principle which states that two fermions 
cannot be in identical quantum states. This aspect will subsequently be incorporated 
into the formal framework by letting the fermions be odd Grassmann-valued, that is, 
anticommuting. Bosons have integer spins and do not have to satisfy the Pauli prin- 
ciple, and they will therefore be even Grassmann-valued, that is, commuting. The 
Lagrangian then has to couple the fermions and the bosons. There are four cate- 
gories of bosons in the model: the photon that mediates electromagnetic interaction, 
the W~ and Z boson for the weak force, eight types of gluons for the strong nu- 
clear force, and finall the Higgs boson that induces spontaneous symmetry break- 
ing of the gauge group for the electroweak interactions by a mechanism described 
in Sect. 2.2.4 below and that thereby provides masses to particles. While all the 
other bosons have been experimentally confirmed with several of them predicted 
by the theory before their experimental observation, the Higgs boson has not yet 
been detected, but it may be detected soon with more powerful particle accelerators, 
because it should be seen at the energy scale where the unificatio between the elec- 
tromagnetic and weak forces takes place (this is about 10! electron volts, or about 
10~!° times the Planck scale). Except for the Higgs boson and the W and Z bosons 
(which, in contrast to the photon, are massive, by the Higgs mechanism), the bosons 
are gauge particles, meaning that their contribution to the Lagrangian is invariant 
under gauge transformations from some internal symmetry group, as described in 
Sect. 2.2.1. The Lagrangians for bosons are of Yang—Mills type, as described in 
Sect. 1.2.3. The gauge group of electromagnetism is the Abelian group U (1), as al- 
ready explained. For the electroweak interaction, the gauge group is SU(2) x U(1). 
We should note, however, that the SU(2) here is not the symmetry group of the 
weak interaction, which is not a gauge theory anyway. In fact, below the energy 
for the unificatio of the weak and the electromagnetic interactions, SU(2) does 
not represent any symmetries. The gauge group U(1) of electromagnetism is not 
the U(1)-factor in SU(2) x U(1), but rather a combination of the U(1)-subgroup 
of SU(2) and the U(1)-factor in SU(2) x U(1) (Weinberg angle). For the strong 
interaction, the gauge group is SU(3). The gauge group of the standard model is 
therefore the product SU(3) x SU(2) x U(1). A class of extensions of the standard 
model, the so-called grand unifie theories (GUTs), postulate that these groups are 
subgroups of a single large symmetry group, for example SU(5). The grand unifie 
theories have the advantage that they reduce the number of free parameters in the 
theory, see the remark at the end of Sect. 2.2.1. This symmetry, however, is only 
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present at very high energies, but is reduced to SU(3) x SU(2) x U(1) at lower en- 
ergies (including those achievable by current particle accelerators) by a process of 
spontaneous symmetry breaking, see Sect. 2.2.4 below. Most of these grand unifie 
theories, including SU(5), had to be given up because (in contrast to the standard 
model) they predicted proton decay at a rate not observed in nature. 

Supersymmetry, to be discussed below, postulates an additional symmetry be- 
tween bosonic and fermionic particles. 

For the fermions, we have 12 different types (“fl vors”, each representing a par- 
ticle and its antiparticle) in the standard model. That fl vor is changed by the weak 
interaction, mediated by the heavy W and Z gauge bosons. The fermions come in 
two classes, leptons (including the electron and the electron neutrino) and quarks. 
Only the latter ones participate in strong interactions, by a property called “color”, 
as already mentioned above. Since, in contrast to the electroweak forces, the strong 
force grows with the distance between quarks, they become confine in hadrons, 
colorless combinations. These can consist either of three quarks, like the protons 
and neutrons, and therefore be fermionic (baryons), or of a quark—antiquark pair, 
and then be bosonic (mesons), like the pions. In particular, these particles, protons, 
neutrons, pions and so on, are not elementary, but composite. The fermions are also 
classifie into three generations. Each fermion in one generation has counterparts in 
the other generations that only differ in their masses, for example the electron, the 
muon and the tau lepton. Ordinary matter consists of fermions of the firs genera- 
tion, that is, the electron, the electron neutrino and the up and down quarks, as the 
ones in the other generations are substantially more massive and quickly decay into 
lower-generation ones. 

While the standard model is well confirme (with some revision to account for 
the experimentally observed neutrino masses that had not been predicted by the orig- 
inal model), renormalizable and generally accepted, it cannot yet be the ultimate an- 
swer because it does not include gravity and does not fare well at the cosmological 
level. Also, it is not entirely satisfactory because it contains too many free parame- 
ters that are not theoretically derived, but can only be experimentally determined. 
(As mentioned, the number of these free parameters is reduced in the grand unifie 
theories.) 

The concepts behind the standard model, however, are theoretically very appeal- 
ing. When extended by the more recent ideas of superstring theory, they may well 
lead to a general theory of all known physical forces, at least according to the present 
opinion of many, if not most, theoretical physicists. !* 

In the present book, we are not concerned with the detailed physical aspects of 
the standard model, but only with the underlying mathematical concepts. Therefore, 
we shall essentially only treat a toy model, the so-called sigma model, that itself does 
not pretend to describe actual physics, but which on the one hand exhibits many of 
the conceptual issues in a particularly transparent manner, and on the other hand 


! As opinions in theoretical physics can change rapidly, this statement may no longer be up to date 
when this book goes to print, and perhaps not even at the time of writing. There seems to be at 
least a tendency towards growing scepticism with regard to the prospects of superstring theory. 
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constitutes the starting point for string theory which, in contrast, aims at physically 
valid predictions. 


2.2.4 The Higgs Mechanism 


For a d-component scalar fiel_ @ = (¢!,..., 7), we may consider the Lagrangian 
1 iu i ges j i 
F= 5 ou agi — fi oo; (w.log. a = aj). 


(Since the metric 6;; on d-dimensional Euclidean space is flat we can freely move 
indices up and down to conform to the usual summation conventions.) 

By diagonalizing the quadratic form a! do ; by an orthogonal transformation— 
which leaves 0,,6'0"@; invariant—we can bring F into the form 


1 . 1 : 
F= 7 Ou! Gi =5 Suid oi 
; 


(the 2; are the eigenvalues of the symmetric matrix (q;;)). 


If all j4; = 0, this Lagrangian describes d scalar particles of masses m; = ,/;. Such 
an interpretation is no longer possible for negative j1;. 

More generally, for a multicomponent scalar field we may consider the La- 
grangian 


1 bay 
FS 55? Oi — VO) 


where the potential V(#) incorporates self-interactions. Typically, V contains 
a quadratic term a; i / and a higher-order term. 

The classical vacuum corresponds to the minimum of V. The problem of sym- 
metry breaking arises, namely that while V itself is invariant, the vacuum may not 
be invariant under the full symmetry group G of F’. In that case, the vacuum consists 
of a whole G orbit, 1.¢., is degenerate. 

Let us consider a simple example: ¢ is a real scalar field 


1 Xv 
Vig) = sud? + 70". 


In order to make V bounded from below, we assume A. > 0. If > 0, then ¢ = 0 is 
the vacuum, and the term gt may simply be treated as a higher-order perturbation 
for the Lagrangian 


1 1 
iy= 5 nha" = sme (m= /n) 
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for a scalar particle of mass m. If 4 < 0, the situation changes, and this interpretation 
is no longer possible. The vacuum is now located at 


go =v =+,/ —. 


In order to make a perturbation around the vacuum, one now has to consider the 
shifted fiel 


d = o) —0v, 
which breaks the symmetry between ¢ and —@. In terms of ¢, the Lagrangian be- 


comes 


1, gaz, 1 30 73 Avs ‘ 
F= 7 oud d"d + zhe Aud ae (+-an irrelevant constant term). 


Since jz is negative, we can interpret @ as a scalar particle of mass m = J. 

We now apply a similar analysis for a massive complex scalar particle coupled 
with a gauge fiel (i.e., a massless vector particle) and consider the Lagrangian (cf. 
(2.2.1), (2.2.14), (2.2.17)) 


F = 5 (nb + Ap \(O"O" A“ p*) — (|p? — 07)? afr (t > 0). 
(2.2.52) 
(The minus sign in front of A” arises because A” is in u(1) = iR, and we have to 
take the complex conjugate (A“¢)* = —A"¢*.) 
For A > 0, the vacuum now is at 


|p| =, 


p= rel”, 


Thus, the vacuum is a nontrivial U(1) orbit, ie., degenerate. We therefore impose 
an additional gauge condition 


Ing =0, Red > 0 
which uniquely locates the vacuum at 
p=t. 


Again, we want to expand around the vacuum and put 


g=p-t. 
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The Lagrangian becomes (up to a constant term) 


| Pee Beate 1 - 1 

F = 5d,$9"6 Ab? ( +21)? + 5 Au ANG + t)? — ae 

= 1, poh — 4Ar7g? + Le Ae pee + higher-order t 

= 59 xt Au gen Fw igher-order terms. 
Thus, up to these higher-order terms, F describes a scalar particle @ of mass m = 
2t/ 2A and a vector particle A of mass t. Because of our above gauge condition, 
the scalar particle now has only one real degree of freedom left; the other degree of 
freedom has been gauged into the vector particle that has acquired a mass. 

Alternatively, we write 


gaelt(r+£)=1+E+int+ OF? +77) 


and consider 


gé=e ito —t, 
A’ =A wr) 
pw 4p Tou 
Then F becomes 
1 7 1 DAE AE 2-2 1 pv 
P= 5059 é&-— at A,A —4At é = ger tuk 


+ higher-order terms in & and A’, 


but 7 has disappeared, gauged into the vector particle A’ that has acquired a mass. 
Let us discuss the Higgs mechanism in more generality. Again, we start with 
a simple scalar fiel and a Lagrangian 


1 
F@)= 5 Ouparg —V@). 


We assume that ¢ takes values in a vector bundle with structure group G, and that 
V is G-invariant. 
Let v be a classical vacuum, i.e., a minimizer of the potential V. Then 


dV ae 
ag = atg@=v. 
We assume that the symmetry group G is broken in the sense that v is only invariant 
under a smaller group H C G, but not under all of G. 
We choose generators o!,..., 0% of the Lie algebra g of G in such a manner 
that 3!,..., 0” generate the Lie algebra h of H. Thus 


owv=0 fori=l,...,M, 


vv#0 forj=M+l,...,N. 
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Since V is G-invariant, the derivative of V in the directions tangent to the G-orbit 
of @ vanishes, that is, 


“go LV (CRP) B)ay =O for all j. 


Differentiating this relation w.r.t. @ yields 


a’Vv dV 
Krlb+ 8 = 0 forall j. 
d¢? do 

Atd=v, a = 0, and hence 


d*V(v) 


Te v/v =0. 


Since for j = M+1,...,N,0/v 40, #v is an eigenvector of the Hessian oe 
with zero eigenvalue. Since this Hessian gives the quadratic term in the expansion 
of our Lagrangian F(@) at the vacuum ¢ = v, and since the eigenvalues of this 
quadratic form are interpreted as squared masses, we interpret exp(0/)v for j = 
M+1,...,N asa massless boson. 

Thus, for each broken generator of the symmetry group, we have found a so- 
called massless Goldstone boson. To get the Higgs mechanism, we introduce 
a gauge fiel A, with values in g, that couples to our scalar fiel @ and consider 
the Lagrangian 


1 oe 
Fp, A)=—3 FF + 5 Op + Aug)(O" + AN) — VO), 


with a G-invariant potential V as before. 
Again, we assume that the vacuum v is only H-invariant. We expand 


M N 
g=vt > go + Yo nydlvt+ O€7 +1’) 


i=] j=M+1 


boo 2 orzo") 


With this expansion and with the notation A,, = A; iO! , we obtain the term 
Tro! vd; vAj,Al# 


in F(@, A), which we interpret as the mass term for the vector fields The gauge 
change 


Aju = Aju — 9uNj; 
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together with the fact that 0/v for j = M+1,...,N is a 0-eigenvector of the 
Hessian of V at v, makes the / disappear in the expansion of F(@, A) up to second- 
order. Thus, we obtain N — M massive vector field that have absorbed the N — M 
Goldstone bosons. 

The idea of a gauge theoretic interpretation of spontaneous symmetry breaking 
was conceived independently by Englert and Brout and by Higgs in the early 1960s. 


2.2.5 Supersymmetric Point Particles 


We now take a step backwards and consider a one-dimensional domain. In geomet- 
ric terms, the aim of this section is to derive a supersymmetric version of the action 
functional (1.1.119) for geodesics, discussed in Sect. 1.1.4. In physical terms, we 
want to introduce Lagrangians that exhibit a symmetry between bosonic and fermi- 
onic fields The supersymmetric point particle is the simplest instance of this. 

We start with the Euclidean case. We consider (t,@) € R'!! as coordinates (see 
Sect. 1.5.2), as well as scalar superfield 


X(t, 0) =O (th +W7(0, a=l,...,d, (2.2.53) 


with o“ even and y@ odd.!> Our Lagrangian is 


1... : 
Ly)= 5 P'Pa + wi Wa) (2.2.54) 
and the action is 
1 ae : 
i= 5 [6% + 0 Gada (2.2.55) 


Here, t and 6 are the independent variables, @ and yw the dependent ones. ¢ is called 
a bosonic field or boson for short, w a fermionic one or fermion. 

Thus, both the arguments and the values of X are Grassmannian. Here, this makes 
X even. We also note that it is important that y be anticommuting in (2.2.55), as 
otherwise an integration by parts would imply that f 4 Wa dt vanishes identically. 


Remark 


1. In the physics literature, one usually puts a factor i in front of the term wy“, 
in the Lagrangian F/, and likewise in the other supersymmetric Lagrangians we 
shall treat here. Since the expression is Grassmann valued, this becomes a matter 
of convention, in contrast to the real-valued situation of (2.2.12). The convention 
with the i is compatible with the following convention usually adopted in the 


As w and @ are both odd, they anticommute. Otherwise, at this point, they have nothing to do 
with each other. 
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physics literature for definin a complex conjugation * on Grassmann variables. 
This conjugation should satisfy 


(1+) =i 4+, (t1T)* = tT}. (2.2.56) 


One define the complex conjugate of an ordinary complex number as the ordi- 
nary complex conjugate, and one assumes that the generators 7,..., Uy of the 
Grassmann algebra are real, i.e., 


o* =H. (2.2.57) 


Ll 
Thus 
Cuba Vay = Top Ven tay: (2.2.58) 


The elements of the Grassmann algebra are called supernumbers. A supernum- 
ber t is then called real if t* = t, imaginary if r* = —t. Thus, vy, --- Bo, is 
real if 5k(k — 1) is even, imaginary if 5k(k — 1) is odd. With this convention, 
the term y“y“, being the product of two real odd quantities, is purely imagi- 
nary, and the factor i then serves to make it real. In any case, a factor i in the 
Lagrangian in front of the w term would then require also a compensating factor 
in the supersymmetry transformations (2.2.68) below. 

2. We may, in fact, put any factor « in front of the term wy“ in the Lagrangian 
F{ (and likewise, we may put factors in front of other terms we shall add to our 
Lagrangians to make them supersymmetric). We then simply need to compensate 
for this in our variations (2.2.68) below, for example by inserting a factor 1/k 
into the right-hand side of the variation for yw. With such a factor «, we can then 
perform expansions of the Lagrangian and other quantities in terms of « which 
is a useful device often seen in physics texts. 


The Euler-Lagrange equations for L; are 


4 =0, (2.2.59) 


y* =0, (2.2.60) 


and L, describes a free superpoint particle. a is a vector index, and the setting 
can be generalized to particles moving on a Riemannian manifold M with metric 
gab(y)dy? ® dy”. #(t) then transforms as a tangent vector to M, and one postulates 
that y(t) likewise transforms as a tangent vector in Ty(;)M, i.e., under a change of 
coordinates y = f(y’) in M, one has 
a oy" 'b : 2 3 
y= yb w’  (w is thus a vector with Grassmann coefficients) (2.2.61) 
y 

(Note that here we are transforming the coordinates in the image; anticipating the 
discussion below, this is perfectly compatible with y being a spinor field because 
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this refers to the transformation behavior w.r.t. the independent variables.) Thus, w 
is an odd vector fiel along the map @. In particular, the scalar product 


(b. Vv) = gan(o)o*w? (2.2.62) 
is invariant under coordinate transformations on M. We may use the Lagrangian 
1 er oe : ; 
La i= 58an($)"o" + > san b)w*V aw’, (2.2.63) 
with 
d F 
Va = Tw + babe (2.2.64) 
The Euler-Lagrange equations for 
S2= / Ly, p)dt (2.2.65) 
are 
ha 1 a jbyciyd 
Va@ = 7 Roca? wows =0, (2.2.66) 
Va ww? =0. (2.2.67) 


In contrast to (2.2.59), (2.2.60), these fiel equations couple ¢ and yw. Equation 
(2.2.66) is the supersymmetric generalization of the geodesic equation V a x“ = 0, 
ef. (1.1.124), (1.1.125). When we consider y as an ordinary field the equations 
(2.2.66), (2.2.67) describe a spinning particle in a gravitational field Unless w = 0, 
the particle no longer moves along a geodesic, because of the presence of the second 
term in the firs equation. 

We return to the action S;, and we perform the variations 


66% =—-ew", 
(2.2.68) 
by" = eg" 


with an odd parameter ¢. The variation of the Lagrangian L of S; is 


es 1 F 1 F 
6Ly = dha [ 5 OV Va a 7 ova 


= —6$ Yat 566 Na + Ueda 


eee ee eee 
=—ée Wa + 580 Wat 5a weda =o Eda 


= -25 (5¥"4e] (using W“e = —ew", as e and wy“ are odd). (2.2.69) 
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Since dL is a total derivative, it follows that S; is invariant under the variation 
(2.2.68). The point of the superspace formalism is now that this variation is induced 
by a variation of the independent variables t, 0. Namely, we consider the so-called 
supersymmetry generators 


Q:=T :=00; + 00, (2.2.70) 
P:=4. (2.2.71) 


Here, Q and P are the notations usually employed in physics texts. The operator 
Q =tT should be compared with the operator D := 0g — 00; introduced in (1.5.34) 
in Sect. 1.5.2. Then 


EQOX"(t,0) =e OO" (t) + W()8) = eb"6 — ew" 
(since 67 = 0 and 6 commutes with $4, but anticommutes with yw“), (2.2.72) 
which yields (2.2.68). We have 


[O, 0] =207 = 2(60; + 99)(09; + 89) = 28, =2P 


(since, e.g., 9 and dg anticommute). (2.2.73) 
Similarly 
[D, D] = —20; = —2P, (2.2.74) 
[O, P] = (00; + 09); — 0;(00; + dg) = 0 
(since dg anticommutes with 6 and commutes with 9,), (2.2.75) 
[D, P]=0, (2.2.76) 
[P, P]=0. (2.2.77) 


Equations (2.2.73), (2.2.75), (2.2.77) mean that Q and P generate a super Lie al- 
gebra, see (1.5.5), 1.e., a mod 2 graded vector space S over C, endowed with a su- 
perbracket [-,-] that is bilinear, mod 2 graded additive and superanticommutative, 
1.e., 


[A,B] =[B, A] if A, B both are of odd degree, 
[A,B]=-—[B, A] otherwise, i.e., if A or B is even 


and that satisfie the super-Jacobi identity (1.5.6), 
(—1)“TA, [B, C]] + (-D)"1B, [C, Al] + (-1)IC, [A, BI] =0, 
where a, b, c are the degrees of A, B and C, resp. Returning to (2.2.69), we have 


él, =-el 
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with 


We also have 
1 pa pa 1 a.j.a 
ol ae d eeu we =eLy. 


These variations for L; and / are quite similar to the ones for ¢“ and yy“, compare 
(2.2.68), except that the roles of bosons and fermions have been exchanged. In the 
physics literature, the representation of the supersymmetry algebra on the $%, yy“ 
space is called a “bosonic multiplet”, whereas the one on the L,,/ space is called 
a “fermionic multiplet”. 

We are now going to consider a functional on R!!? with coordinates (t, 6!, 67), 
and the supersymmetry generators 


Ou = 09; + dee, (2.2.78) 
P=a; (2.2.79) 


They span a super Lie algebra with 


[Qu, Op] = 2dag P, 


(2.2.80) 
[Qa, P]=0=[P, PI. 
We try a superfiel 
X(t, 0%) = 94 (1) + WA (6. (2.2.81) 
We have 
6Q1X%(t,0', 07) = —ew? + 290! — efo'6?, (2.2.82) 


and similarly for ¢Q>. This is different from the previous situation as we now also 
get a 0'6? term that can neither be considered as a variation of #“ nor as a variation 
of Ww“. This problem stems from the fact that we now have only one bosonic fiel @, 
but two fermionic field yy), wy. Since supersymmetry mixes bosonic and fermionic 
fields we need to introduce an additional bosonic fiel and consider the superfiel 


¥47(t,0%) = $7 (t) + WS (0% + F2 (10167 (2.2.83) 
with an even F“. We now get 


eQiY(t,0%) =—ew? + 6670! + eF 40? — efo'6?. (2.2.84) 
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Thus, we get the variations 


6p% =—-epy, 
byt = ef", 
(2.2.85) 
és =eF*, 
6F4= —e 5 
for Q; and analogous variations for Qo. 
We consider the action 
1 2. : 
83= | 56%ba + Ve boa + FA Fat (2.2.86) 
The Euler-Lagrange equations for L3 are 
$ =0, 
wt =0, (2.2.87) 
F? =0. 


Thus, the equations for the F“ are trivial, and the F“ are auxiliary variables that do 
not evolve and can be eliminated. They are only needed to close the supersymmetry 
algebra. We also observe that on-shell, i.e., if the equations of motion (2.2.87) are 
satisfied we have 


£Q)X"(t,0%) =—ews + ef 9! (2.2.88) 


so that the supersymmetry algebra closes here without the F“ field On-shell, the 
number of degrees of freedom of the yf field is reduced so that we no longer need 
the F“ fiel in order to restore the balance between bosonic and fermionic fields 
and on-shell, F“ vanishes anyway. 

We may also write things in a more invariant manner. Namely, with 


X(t,0) =o +¥(0)9, (2.2.89) 
we have 
DX =-60-w (2.2.90) 
and 
D(DX) =-(¢+ W8), (2.2.91) 


and so we have 


1 
S}= y 5 DX D(DX)dtdd. (2.2.92) 
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Since the supersymmetry generator Q anticommutes with D, we now see directly, 
without any need for further computation, that L; remains invariant under super- 
symmetry transformations. Similarly, to represent 53, we consider the operators 


Dy = —0% 0; + Ope. (2.2.93) 
From (2.2.83), we obtain 


1 
S3=5 / e° DyY DgY dtdé7do', (2.2.94) 


for the fiel Y, where the antisymmetric €-tensor satisfie 
eVa——-l =], (2.2.95) 


We observe that, in contrast to (2.2.92), (2.2.94) contains only two Ds, the reason 
being that here we have two odd variables, 6! and 62, that are integrated. The Euler— 
Lagrange equations (2.2.87) for L3 then become 


€"? Dy DgY =0. (2.2.96) 


We now wish to include self-interaction terms in the functional and consider a 
(smooth) potential function of the superfield Y“, 


WY") = Wo" + 20% + F40'67), (2.2.97) 


Thus, we have the expansion 


= dw(d) apna apnlp2 1 a°w) agna,, bap 
WP) = wd) + SO (gor + F190) + = sorupor. (2.2.98) 
We introduce an interaction Lagrangian 
Lin. =— / wv(t,6!,67))dtdodo! (2.2.99) 
dwbt)) a , Pwelt)) 44 
I( 36" (1) Fe+ apa(nade(n Wh at, (2.2.100) 
The total Lagrangian is 
L3 + Lint. (2.2.101) 


The corresponding Euler-Lagrange equations include the following equation for F 


pe we) (2.2.102) 
age 


Again, this is an algebraic equation and thus eliminates F“, and we may write 


1dw(¢) dw(d) 92 w() 
2 d6¢ dda apap" 


1... 1 : 
Lat Lin = f dr( 58%ba + 5 Va Wau wivd), 


(2.2.103) 
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2.3 Variational Aspects 
2.3.1 The Euler-Lagrange Equations 


Here, we present a brief summary of the calculus of variations as needed for treat- 
ing action functionals and their symmetries. For more details, we refer to [66]. We 
consider a Lagrangian 


L= f F(.ue),du(x)dx (2.3.1) 


and variations 
u(x) > u(x) + sdu(x); (2.3.2) 


here, s is a parameter, and 5u(x) is the variation of u at x.'* This means the follow- 
ing: 

6L(u) d 

aah F(x, u(x) + sdu(x), d(u(x) + sdu(x)))dx)s=0. 
u S 
(2.3.3) 

More generally, one may consider a C?-family of diffeomorphisms h,(u) of the 
dependent variables, define for s in some neighborhood of 0, with ho being the 
identity, and 


8L(u)(du) = 


d 
—hs(u(x))|s=0 = du(x), (2.3.4) 
ds 
and 
d 
b6L(u)(6u) = & | Foe heute), dis u(x dao. (2.3.5) 


Since we consider only infinitesima variations, (2.3.3) and (2.3.5) are the same, and 
we may use either formulation. 
We now assume that 


- _ dL (u)\ | 
(u)(du)| = a =0 (2.3.6) 


for a variation du. We compute 


dL(u)(6u) = / (F: (x, u(x), du(x)) du(x) + Frye (x, u(x), du(x)) sozbuls) a 


'4The integration is supposed to take place on some domain Q, but as that domain will play no 
essential role, we suppress it in our notation. In many situations, the variations du are required to 
satisfy certain conditions at the boundary of Q (because u itself is constrained there), but again, 
that will not be essential in the present context. Of course, it is important to realize that the integral 
in the definitio of the Lagrangian is a definit one. Likewise, we suppress other constraints that u 
may have to satisfy, and that need to be preserved by its variations. 
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=[(F (x, u(x), i) alle? d sy Fp (x, u(x), au(x)) ous 
(2.3.7) 


where p® is a dummy variable for the place where ie is inserted, and subscripts 
denote partial derivatives, e.g. Fy, := oF (in fact, u might be vector valued, and in 
that case F,, stands for all the partial derivatives of F w.r.t. the components of uw.) 
For the last line, we have integrated by parts, assuming that the variation du is such 
that no boundary term occurs. We note the full derivative Ay that indicates that we 
need to differentiate F,« (x, u(x), du(x)) for all three occurrences of x. 

Comparing (2.3.7) with (2.1.52), we obtain 


bL(u) 
bu(x) 


= F,(x, u(x), du(x)) — sake (x, u(x), du(x)). (2.3.8) 


Thus, qa represents the Euler-Lagrange operator. — 
We now assume that u is a stationary point, e.g., a minimizer of L, in the sense 
that (2.3.6) holds for all variations uv satisfying an appropriate boundary condition. 


We then obtain the Euler-Lagrange equations 
5L(u) 
bu(x) 


When we wish to derive things in a geometrically invariant way, we should change 
the preceding formalism slightly. The reason is that the integration measure dx em- 
ployed in (2.3.1) is not geometrically invariant. More natural is the volume form 


(2.3.9) 


det g;;dx for a Riemannian metric g;;. (2.3.10) 


Thus, in place of (2.3.1), we should consider 


L= f G(x,uex),du(a)/detgijde. (2.3.11) 
We abbreviate 
Je = det gi}. (2.3.12) 
The Euler-Lagrange equations (2.3.9) then become 
pea = G(x, u(x), du(x)) — — Bot ipG iti u(x), du(x))) =0. (2.3.13) 
bu(x) a Shs 


2.3.2 Symmetries and Invariances: Noether’s Theorem 


We consider an action 


L= f F(uex),du(x)dx (2.3.14) 
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that is infinitesimall invariant under some variation 
u(x) > u(x) + sn(x). (2.3.15) 
As just explained, the invariance means that 


6L:= - i F(x, u(x) + s(x), d(u(x) + 5n(x)))dx\s—9 = 0. (2.3.16) 


AY 


In contrast to the preceding, here we consider arbitrary field wu, but only particular 
variations 7—above, we had considered arbitrary variations du for a particular wu. 

Again, we may alternatively consider a C*-family of diffeomorphisms h,(u) of 
the dependent variables, define for s in some neighborhood of 0, with ho being the 
identity, and {hs (u(x))\s—0 = n(x). We now assume 


i F(x, hs(u(x)), dhs (u(x)))dx = / F(x, u(x), du(x))dx 


for all s near 0 and all admissible uw. The interpretation that a variation arises from 
a diffeomorphism of the dependent variables that leaves the action invariant is useful 
when one wants to analyze invariances in the context of global analysis. 

As in (2.3.7), we obtain 


0 
0= / (Fut u(x), du(x))n(x) + Fpe(x, u(x), au(s)) os) ds. (2.3.17) 
x 
We now consider a more general variation 
u(x) > u(x) + s(x) n(x), (2.3.18) 
that is, where the variation parameter s may also depend on x. Since the variation 
5L vanishes for constant s, it must now be proportional to the derivative of s, that 
iS, 
0 
6L= Hie Nestea) GUN) ID gg GR. (2.3.19) 
x 


If we now assume in addition that u is stationary, that is, 6L vanishes for all varia- 
tions, (2.3.19) vanishes as well, and we conclude 


d 
a0 (x, u(x), du(x))n(x)) = 0. (2.3.20) 
This is a special version of Noether’s theorem. We defin the Noether current 


J (x) := Fpa(x, u(x), du(x))n(x). (2.3.21) 


Noether’s theorem thus says that j is conserved in the sense that 


a 
div j = —~ j*(x) =0. (2.3.22) 
ox® 
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As at the end of Sect. 2.3.1, when we consider a functional of the form (2.3.10), we 
obtain the geometric version of Noether’s theorem, 


div j = a ae dae (x)) = (2.3.23) 
with j% (x) := Gpa(x, u(x), du(x))n(x). 
For the general version of Noether’s theorem, we also allow for variations of the 
independent variable x. That means that we consider 


x x’ := x4 56x, (2.3.24) 
u(x) > u(x’) = W(u(x)) = u(x) + sda). (2.3.25) 
When we write 
ul (x) = u(x) + sn(x) (2.3.26) 
we have 
n= dw — aoe (2.3.27) 


Since now the integration measure dx also varies under (2.3.24), the Noether current 
becomes 


d 
j% = Fya(x, u(x), aut (00 7 aa") 4+ F(x, u(x), du(x))dx", (2.3.28) 
x 
and again a conserved quantity, 
div j : i~ (x) =0 (2.3.29) 
= X)=U. J. 
J axe J 
Here as well, in the Riemannian setting, we instead have 
& du B 6 
:= Gpa(x, u(x), du(x)){ dy — ae + G(x, u(x), du(x))dx* (2.3.30) 
Xx 
and (2.3.29) should be replaced by 


div j = aia" (x)) = (2.3.31) 


cf. (2.3.23). 
Finally, in many situations, the Lagrangian is only invariant up to a divergence 
term, that is, 


oe [ovaw. 5x, du) =0. (2.3.32) 
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In this case, we obtain that (in abbreviated notation when compared to (2.3.28)) 
div(Fpdu + (F — Fydu)dx + A(u, dx, du)) = 0. (2.3.33) 


The standard example is the conservation of energy for time-invariant Lagrangians. 
We consider the action 


So= [ (G84 = v«6) Jar (2.3.34) 


Since the integrand does not depend explicitly on ¢, it is invariant under a variation 
t > t+ 6t, and from (2.3.28), we conclude that the negative of the Hamiltonian 
(= energy) is preserved, this being given by 


504 +V(). (2.3.35) 
The same happens for our supersymmetric Lagrangian (2.2.55), 
S1= 5 [Oda + W vada: (2.3.36) 
again, the Noether current is 


io 
56" ba, (2.3.37) 


that is, minus the Hamiltonian. We observe that the Noether current here contains 
only the bosonic fiel @, not the fermionic one yw. We may also consider supersym- 
metry invariance in this framework. When we perform the variations (2.2.68), we 
compute that the associated current is given by 


—edbw. (2.3.38) 


The superspace formalism represented the supersymmetry variation as a variation 
of the independent variables. Since S$; in (2.2.92), however, also contains terms of 
the form D*X, the formalism needs to be slightly extended to carry over. The La- 
grangian $3 as written in (2.2.94) does not present this problem, and so, in that case, 
the conserved current can be computed from a variation of the independent variables 
without the need for an extension of the formalism, except that of course we now 
need to take the signs into account as always when performing supercomputations. 

We now finall derive some implications of Noether’s theorem in the Minkowski 
setting. According to (2.3.29), invariance implies a conserved current: 


aa j% =0. (2.3.39) 


From this, we obtain a conserved charge: 


=i esp (2.3.40) 
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where j° is the time component of j and d¢—!x denotes the integration over a space- 
like slice, 


d is ; 
<= | at's aoj° 
=— ; d‘—!x dq j@ (a running over spatial indices) by (2.3.24), 


=0 (2.3.41) 


by the divergence theorem when j vanishes sufficientl quickly at spatial infinit . 


2.4 The Sigma Model 


In this section, we discuss an action functional that is fundamental to conformal 
fiel theory and string theory, the sigma model and its nonlinear and supersymmetric 
versions. In the mathematical literature, the corresponding theory appears under the 
name of harmonic maps, and we refer to [65] for a detailed treatment with proofs and 
references; for the supersymmetric version, the harmonic map needs to be coupled 
with a Dirac fiel as treated in [16]. A monograph about this topic from physics 
is [73]. 


2.4.1 The Linear Sigma Model 


We let M be a Riemannian manifold of dimension m, with metric tensor in local 
coordinates (Yes )a,p=1,....m,-)> 
We recall the following notation: 


wanes 


y= det(Yop)s 
1 . 
r= ae (¥p6,n + Yn3,8 — ¥Bn,8) (Christoffel symbols). 


For a function ¢ : M > R of class C!, we consider 


dG(x) d(x) 
ax® axBb 


yP (x) (2.4.1) 


'SThe conventions here are different from the ones established in Sect. 1.1.1 where the metric of 
M was denoted by g;;. The reason is that for the nonlinear sigma model, another manifold will 
come into play, the physical space(—time) whose metric will then be denoted by g;;. M will be the 
world sheet instead. 
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in local coordinates (x!,...,x’") on M. The quantity (2.4.1) is simply the square 
of the norm of the differential d@ = aaa”, which is a section of the cotangent 
bundle 7* M, that is, 


Ip (x) OO) 
ax® axB 
in hopefully self-explanatory notation. Therefore, it is clear that (2.4.1) is invariant 


under coordinate changes. 
The Dirichlet integral of @ is then 


yF (x) = (do, dd)r+m = \ldol|* (2.4.2) 


1 1 
S(¢) == sf yP (x) 2 ae of Jyax' Ns as" =f lldp||"dvol,(M). (2.4.3) 


Minimizers are harmonic functions; they solve the Laplace—Beltrami equation 
(see (1.1.103)) (the Euler-Lagrange equation for S(@)) 


1 a a 
Aug =—— “P___) —0. 2.4.4 
a (v7 a) (2.4.4) 
We now specialize this to the case where M is two-dimensional, that is, a surface 
equipped with some Riemannian metric. According to the conventions set up in 
Sect. 1.1.2, we can then let the indices a, B stand for z,Z, where z = x! + ix? is 
a complex coordinate; when we want to avoid indices, we shall also write z = x+y, 
as in Sect. 1.1.2. Thus 
00 9b 009d 2d 90 


+2y% +y% 2.4.5 
iete Gabe Ge ae eae) 


Idol? =y 


and, recalling (1.1.76), 


1 dd dd 
= Ber 2 
S(d) = sf " ax® axB ABV 1122 — Yi dx Ady 


1 8696. 0h 96 5 dG a 
ea over 49 ZZ 
L@& ag oz to’ aca tT’ OE OE 


X Vf VecVez — Vp dz A dz. (2.4.6) 


A fundamental point in the sequel will be to consider S not only as a function of the 
fiel @, but also of the metric y. We thus write 


S(o, y). (2.4.7) 


Naturally, we then also want to study the effect of variations dy of the metric on S; 
it is in fact more convenient to study variations of the inverse metric y~!. Observing 


that \/ 1122 — Vip = OV yy? — (v!2)?)—!, we compute 
5S.) = i Tips"? | Ain = V2, dendy (2.4.8) 
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with 


_ 96 96 1 en 9 Ib 
0B axe axb 27°F" aye ayn” 


(2.4.9) 


We call Tyg the energy-momentum tensor!® and observe that T is trace-free, that 
is, 


T2 =0. (2.4.10) 


Here, the dimension 2 is essential. The reason why T is trace free is the conformal 
invariance of S in dimension 2. This simply means that when we change the metric 
to e° y for some function o : M > R, then S stays invariant: 


S(g,e°v) =S, y). (2.4.11) 


Infinitesimall , the variation of y~! is —Sa y~!, and from (2.4.8), (2.4.11), we get 


0= | Tp ba yl yr2 a Vb dx Ady 
= | Te 804) Vi y22 — Vj dx A dy (2.4.12) 


for all variations 50, which implies (2.4.10). 

We now consider this from a slightly different point of view. A Riemannian 
metric y on a surface induces the structure of a Riemann surface &, as define 
in Sect. 1.1.2, via the uniformization theorem (for a detailed treatment, we refer 
to [64]). As a consequence, we can fin holomorphic coordinates z = x + iy for 
which the metric is diagonal, that is, 


yi2=0 and yi=y22, (2.4.13) 


or equivalently, 
Vez =O = zz. (2.4.14) 


We can then express the metric tensor by a single (nonvanishing) scalar function A, 
that is, as 


dzdz; (2.4.15) 


cf. (1.4.18). As explained in Sect. 1.4.2, a Riemann surface © can be considered 
as a conformal equivalence class of metrics of the form (2.4.15). When we choose 


'6Since we consider a Euclidean instead of a Minkowskian situation, we cannot distinguish be- 
tween temporal and spatial directions, and thus also not between energy and momentum as quan- 
tities that are preserved because of temporal or spatial invariance, according to Noether’s the- 
orem. This explains the name energy-momentum tensor here. In general relativity, the energy— 
momentum tensor emerges because Lorentz invariance combines temporal and spatial invariance. 
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coordinates so that this holds, our functional S also simplifies 


oi d¢ dh dpa ag dg 
S@,y)= ace e + ay ve ax na y= | 25 ae ° ae Kit. (2.4.16) 


Thus, the dependence on the metric disappears, except that the holomorphic coordi- 
nates z = x + iy have been chosen so as to diagonalize the metric. In other words, 
S is a function of the equivalence class of metrics encoded by &, and we can write 
it as 


S(, X). (2.4.17) 


In these conformal coordinates, that is, where (2.4.13), (2.4.14) hold, the condition 
(2.4.10) becomes 


Tz = 0. (2.4.18) 
rr 
a , and so 733 = 
T.,. Therefore T is determined by its component T-,. Taking its the transformation 
behavior into account as well, the energy-momentum tensor becomes a quadratic 
differential 


Since we take the fiel ¢ to be real-valued here, we also have oe = 


Fy 2 
T,,dz = (3) dz” (2.4.19) 
z 


from (2.4.9). 
When the metric takes the form (2.4.15), the Laplace—Beltrami equation satisfie 
by critical points of S also simplifies 


4 ao 
———_ = 0, 2.4.20 
i 0zdz ( ) 
which is equivalent to the simpler equation 
2 
7 a =0. (2.4.21) 
“9202 


In this presentation, the dependence on the metric is no longer visible. This sim- 
ply comes from the fact that we write the equation in local coordinates, and local 
coordinate neighborhoods are conformally equivalent to domains in the Euclidean 
complex line C. When we return to the global aspects, as explained, we have a func- 
tional S$(¢, &) that depends on the Riemann surface &. In Sect. 1.4.2, we have 
considered the moduli space of Riemann surfaces, and we can thus consider S as 
a functional on the moduli space M, of Riemann surfaces of some given genus p. 
As explained in that section, that moduli space is not compact for p > 0, and one 
should then consider an extension of S to a compactificatio of M,. Such a com- 
pactificatio was constructed by pinching homotopically nontrivial closed curves 
(represented by closed geodesics w.r.t. a hyperbolic metric for p > 1), thus creating 
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surfaces with singularities. Those singularities were then removed by compactifying 
the resulting surfaces by two points, one for each side of the closed geodesic. This 
now connects well with the behavior of the functional S, because its critical points, 
the solutions of (2.4.20), (2.4.21), are harmonic functions. And bounded harmonic 
functions can be smoothly extended across isolated singularities. That means that if 
we consider a sequence of degenerating Riemann surfaces ©, and controlled har- 
monic functions uy, (with some suitable norm bounded independently of 7) on them, 
we can pass to the limit (of some subsequence) that then define a harmonic func- 
tion u on the Riemann surface & obtained by the described compactificatio of the 
limit of the Riemann surfaces. That harmonic function is then smooth on all of X, 
and in particular, it does not feel the presence or the position of the puncture, that 
is, of the points added for the compactification In particular, the functional S then 
naturally extends not only to the Deligne-Mumford compactification but also to 


the Baily-Satake compactificatio M, (see Sect. 1.4.3 of the moduli space M,). 
For more details, see [62]. 

There is one point here that will become important below in Sect. 2.5. While the 
equation of motion, our Euler-Lagrange equation (2.4.20), is conformally invariant 
in the sense that the conformal factor = plays no role, the corresponding differential 


operator, the Laplace—Beltrami operator > a is not conformally invariant itself. 

From (2.4.20), we see directly that the energy-momentum tensor as given by 
(2.4.19) is holomorphic at a solution of (2.4.20): 

OT: 

dz 


=0. (2.4.22) 


In conclusion, the oo en tensor yields a holomorphic quadratic differ- 
ential T,,dz? = (2 2)2q2? on our Riemann surface D. 

There is a dene reason why T is holomorphic. As we shall now explain, S 
is invariant under diffeomorphisms, and by Noether’s theorem, this yields a con- 
served current, that is, a divergence-free quantity. That latter equation then turns 
out to be equivalent to (2.4.22). The reason is simply that (2.4.6), or equivalently 
(2.4.16), is invariant under coordinate changes. In mathematical terms, as explained 
in Sect. 1.1.1, this means that we compose the fiel @ with a diffeomorphism h of 
our surface and simultaneously pull the metric y in (2.4.6) or the area form dx A dy 
in (2.4.16) back by that diffeomorphism. In other words, we have 


S(ooh, h*y) = S(¢, y). (2.4.23) 


In the formalism of physics, we move the points in the domain by an infinitesima 
diffeomorphism, that is, a vector field and consider the variation 


x” + ¢€dx% or, in complex coordinates, z + €5z. (2.4.24) 


By (2.3.28), the conserved current is 


2 
— (2) 82, #--(#) ie (2.4.25) 
Oz dz 
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and with j. = y-:j* (cf. 1.1.2 and note that y,, = 0 by (2.4.14)), (2.3.31) becomes 


2 
0= 2 = (3) 5z. (2.4.26) 


When we take holomorphic variations, bz = 0, that is, respect the Riemann sur- 
face structure, this becomes (2.4.22), the holomorphicity of the energy-momentum 
tensor at a solution of the Euler-Lagrange equations, that is, (2.4.20). 

We now wish to connect this discovery with 7 in Sect. 1.4.2. There, we had also 
found a holomorphic quadratic differential as a (co)tangent vector to the moduli 
space of Riemann surfaces. When we consider S(¢, y) as a function of the met- 
ric y, its derivative with respect to y should be a tangent vector to the space of 
all metrics on our underlying surface. Here, we have been considering variations 
with respect to the inverse metric y~!, and thus, we obtain a cotangent instead of 
a tangent vector to the space of metrics. In 7 of Sect. 1.4.2, we have distinguished 
three types of variations of metrics, the ones through diffeomorphisms, the ones by 
conformal factors, and the residual ones that correspond to tangent directions of the 
Riemann moduli space. Now our functional S(@, y) is invariant under the firs two 
types of variations: diffeomorphism invariance led to the holomorphicity (2.4.22), 
and conformal invariance made the energy-momentum tensor trace-free, (2.4.18). 
Therefore, it must correspond to a cotangent direction of the Riemann moduli space, 
and thus the agreement with the condition (1.4.20) is no coincidence. 


2.4.2 The Nonlinear Sigma Model 


In the nonlinear sigma model, the fiel_@ takes its values in some Riemannian man- 
ifold N with metric g;;, instead of in the real line R. In the physics literature, one 
is usually interested in the case where N is the sphere S”, that is, a homogeneous 
space for the Lie group O(n + 1) (one then speaks of the nonlinear O(n + 1) sigma 
model), or more generally, where N is the homogeneous space for some other com- 
pact Lie group. The case where N itself is a compact Lie group G leads to the 
Wess—Zumino—Witten model (see for instance [38, 73]). For the mathematical the- 
ory, however, one can consider an arbitrary Riemann manifold N, and this generality 
should make the structure more transparent. In fact, this will also be necessary for 
the applications to Morse theory presented below. 

The action functional for the nonlinear sigma model is formally the same as 
(2.4.3), 


1 
S@=5 / l|dp||"dvol(M), (2.4.27) 
M 
where the norm of the differential is now given by 


ag! (x) AG! (x) 
ox@ axBb ¢ 


ldp||? =v (x) 81; (@(@)) (2.4.28) 
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Expressed more abstractly, the differential of ¢, 


ag! F) 
dx® a 
axe . a¢' 


do= (2.4.29) 
is now a section of the bundle T7* M @ @—!T N where the latter bundle is the pullback 
of the tangent bundle of the target NV by the map @. Since the bundle thus depends on 
the fiel _@, the situation is intrinsically nonlinear. In particular, the Euler-Lagrange 
equations are also nonlinear: 


i a k 
T@):= ah = 


1 
JV ox ax aa ( ax 

(2.4.30) 
with the Christoffel symbols as in (1.1.60). (The expression t(@) is called the ten- 
sion fiel of ¢.) Whereas this nonlinearity makes the analysis more subtle and much 
harder, see [65], most of the formal aspects remain unchanged when compared with 
the linear version of Sect. 2.4. Solutions of (2.4.30) are called harmonic maps in the 
mathematical literature. 

We are again interested in the situation when the underlying domain M is a Rie- 
mann surface. As in the linear case, the action (2.4.27) is conformally invariant, 
and so we can consider it either as a function S(¢, y) of the domain metric y or as 
a function S(¢, ©), with the Riemann surface & considered as the equivalence class 
of conformal metrics that y belongs to. Thus, conformal invariance is preserved in 
the nonlinear case, and so is, obviously, diffeomorphism invariance. Therefore, we 
again obtain a holomorphic energy-momentum tensor as before, 


dp Io a> io 4 
Er -(%. se) ae = sii) 


Viv a6!) + YP OOO 5 3 a! 


(2.4.31) 


where we use the scalar product define by the metric of N. When one does the 
computation right, it is the same as in the linear case and therefore need not be 
repeated here. 

For example, from (2.4.8), we also see that S is critical for variations of the metric 
y when the energy-momentum tensor vanishes. According to (2.4.19), this means 


ap dp\ _ 
(2. xs) -~0 (2.4.32) 


(note that we ae not taking a Hermitian product here, and so this quantity can well 
be 0 without %¢ ® being 0 itself—when that happens, we say that “ is isotropic), or 
in real cootdinsies x,y with z=x+iy, from (2.4.9) 


a6 dg\ _ [ad ae ad ao\ _ 
(ee) =e), (Fe) =o. a 


When those relations hold, the map ¢: & — N is conformal. Since by a special 
case of the Riemann—Roch theorem, see Sect. 1.4.2, every holomorphic quadratic 
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differential on the sphere S* vanishes, we conclude that on S*, the energy— 
momentum tensor associated with a harmonic map automatically vanishes, and 
therefore, any harmonic map ¢: S* — N into any Riemann manifold N is con- 
formal. For Riemann surfaces of genus > 0, this is not true. 


2.4.3 The Supersymmetric Sigma Model 


We now extend the sigma model to include supersymmetry, proceeding as in 
Sect. 2.2.5. We work with the Clifford algebra C/(2,0), which admits a real rep- 
resentation as explained in Sect. 1.3.2. In fact, this representation is a dimensional 
reduction of that of C7(2, 1), and so the two-dimensional formalism to be developed 
here is a dimensional reduction of a three-dimensional one. As explained in [23], 
a three-dimensional space (with Minkowski signature) is the basic setting for N = 1 
supersymmetry, but for our purposes, conformal invariance is a crucial underlying 
feature of our variational problems, and therefore, we continue to focus on the two- 
dimensional case and consider (2|2) dimensions here (with Euclidean signature). 
We choose local even coordinates x!, x2 and odd ones 6!, 6. 

In order to conform to the conventions employed in [23], we use the following 
representation of C/(2, 0): 


-1 0 OG <1 
asr=(% hs asr=(2 .) (2.4.34) 


which is different from the one described in Sect. 1.3.2 (but of course equivalent to 
it). We recall that the y” satisfy 


{y4, y"} = 288”. (2.4.35) 


This is a real two-dimensional euclidean representation, and so we have real euclid- 
ean Majorana spinors satisfying 


vawvi=v' =(, Wy). (2.4.36) 


In particular, we could leave out the bars for complex conjugation (and we shall do 
so sometimes). Since these spinors are supposed to anticommute, we also have 


Wx=—-xy, wy x =—y'y. (2.4.37) 


For a spinor fiel yw, we then have the Dirac form 


a cia 
- Ox Ox 
Dw =(W, : 2 
wDw = (v1, W2) |v as +Y ads 

ax! ax? 

V1 Ch) OW2 OW 
= 2.4. 
V1 axl + 7) aul V1 ax? Wo ax? (2.4.38) 
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For later purposes, we observe that this can also be expressed in complex notation 
as 


UDy = -(on fe ie) + iy) + 1 — ia) Cn = ivn)) (2.4.39) 


with z=x! +ix?. 
We defin the vector field 


Dy := 49: — 0'8,1 — 08,2, 


(2.4.40) 
Dy := dg. — 01 8,2 + 07,1, 
01 := 91 + 0!0,1 +079,2, 
(2.4.41) 
Q> := dg. + 018,2 — 078, 1. 
They satisfy 
[D,, Dj] = —20,1, [D,, D2] = —20,2, [D2, D2] = 20,1, 
(2.4.42) 


[Q1, O1] = 20,1, [Q1, O2] = 20,2, [Q2, 02] = —20,1. 


We are now ready to introduce the supersymmetric sigma model. We consider a su- 
perfiel Y with expansion 


Y = (x) + Wa(x)0% + F(x)o!07 (2.4.43) 
and the action 
1 
S4= i qo (De, DpY)d?xdo7d@! (2.4.44) 


where dé indicates that a Berezin integral has to be taken; namely, we recall from 
Sect. 1.5.2 that for an expression Z = z + zy0% + z120!07, 


/ Zd°0 = / Zd07do! =z, (2.4.45) 


that is, the 6-integration picks out the 6!6* term, see (1.5.27). Moreover, the anti- 
symmetric €-tensor is define by 


«P= =], (2.4.46) 


S4 is the Wess—Zumino action (in fla Euclidean space). After expanding and carry- 
ing out the Berezin integral, this becomes 


1 - 
Si=5 / dx(0!'b Qube + Vy" utha +P" Fa). (2.4.47) 
In complex notation, this looks as follows: We set 


We =i —iyn, v= vitin (2.4.48) 
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and 
04 := 0; + i60, 6_ := 0, — i) (2.4.49) 


and defin the operators 
Dis := 06, +6482, D_ := 06_ + 0_4;. (2.4.50) 
With this notation, (2.4.43) becomes 
I . 
Y= + 5(H+0s + ¥0-) + = F040. (2.4.51) 
We then have 


1 
S4= i 5 D-YD,Yd"xd6_d6., 


= [ 5(42.0%%64 Wy ole wv a + FIR, |x, (2.4.52) 
2 Oz 0z 


Sq is invariant under the supersymmetry transformations 


5b" =F", (2.4.53) 
by" = yd," —2F*, (2.4.54) 
5F¢ = ey" dy". (2.4.55) 


Indeed, the variation is 
5S4 = 5 | 4x2 280" 9G, Hey dO y"OyWa +EF "Oy Wa 
+ Wy Oily’ edva + &Fa) — yew" Fa + Ou (E)W1d" ga), (2.4.56) 
which vanishes after integration by parts and using (2.4.35) and (2.4.37), when we 
assume that € is constant. Locally, the latter can be assumed, and we do so for the 


moment, but later on, in Sect. 2.4.7, we shall return to the global issue. 
The Euler-Lagrange equations for $4 are 


e*? Dy DgY =0, (2.4.57) 
or in components, 
Ag =0, (2.4.58) 
y" dup =0, (2.4.59) 
F=0. (2.4.60) 


In complex notation, these equations become 
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D_D,Y =0, (2.4.61) 
or in components 
0,026 = 0, (2.4.62) 
dzw+ =0, dw =0, (2.4.63) 
F=0. (2.4.64) 


Again, F' is a nonpropagating, auxiliary fiel that is only introduced to close the 
supersymmetry algebra off-shell. On-shell, (2.4.53) and (2.4.54) become 


bp =e", (2.4.65) 

byt = y"ed og". (2.4.66) 

(Note that (2.4.59) implies that 6F“ = 0 on-shell, i.e., the term that obstructs the 
closing of the algebra is proportional to one to the equations of motion.) 

We now turn to the supersymmetric nonlinear sigma model. In fact, the formal- 


ism remains the same; we just need to expand its interpretation. 
Thus, we consider a map 


Y:M->N (2.4.67) 


from a (2|2)-dimensional supermanifold to some Riemannian manifold N. We ex- 
pand Y as before: 


Y =(x) + Wa(x)o® + F(x)0!6?. (2.4.68) 


@ can be considered to be an ordinary map into N, whereas the odd part w repre- 
sents an (odd) section of the pull-back tangent bundle ¢*7N.!7 (., .) now denotes 
a Riemannian metric on the target space; we shall also write ||v||? := (v, v) be- 
low. 

Finally, F is an auxiliary fiel as before. This time, the algebraic equation for F 
among the Euler-Lagrange equations is 


. —k . —k . 
—4g),(@)F' +2gijnw W' — giv vw =, 
1.6: 
Fiariv wi, (2.4.69) 


so that F can again be eliminated. In particular, when we use Riemann normal 
coordinates at the point under consideration, F vanishes. 


'/ Thus, w.r.t. coordinate changes on the target N, y transforms as a vector, whereas on the do- 
main M, it transforms as a spinor. In particular, the setting here is different from the one above 
in Sect. 1.5.3 for maps between super Riemann surfaces, where the odd fiel has to transform as 
a spinor on both domain and target. 


162 2 Physics 


In local coordinates, after carrying out the 0-integral, the Lagrangian density 
becomes 


slaei? += 5 Dy) - Te (a, Rp, Vy) Wa). (2.4.70) 


(We get the R-term (the curvature of the target NV) after elimination of 5 || F||?.) This 
comes about as follows: According to the rule for the Berezin integral, we need to 
identify the 6'6?-term in (2.4.44). For that purpose, we recall that a function of 
a superfiel Y has to be expanded by Taylor’s formula as explained in Sect. 1.5.2, 
see (1.5.20), (1.5.21). In particular, (2.4.44) contains the metric tensor (., .) of the 
target N. In local coordinates, we have a tensor g;;(Y) whose expansion contains 
second derivatives g;;,47(¢) multiplied with 6 ‘W107 W2, which gives the curvature 
term in (2.4.70). Terms with firs derivatives of g;; do not carry an invariant meaning 
and become 0 in suitable coordinates (Riemann normal coordinates) at the point in 
N under consideration. In particular, the curvature tensor R has to be evaluated at 
the point #(x) € N. Similarly, the Dirac operator DP contains a covariant derivative 
at the tangent space Ty.) N. 

We now list the important results for the nonlinear supersymmetric sigma model 
(for detailed computations see [17]): The Euler-Lagrange equations for the nonlin- 
ear supersymmetric sigma model are 


1 si 1 eat 
t™(G) — Rij (Vol - wl) + 8 Riki: ob wy) =0, (24.71) 


Dy” — =R uve yl = (2.4.72) 


The firs equation generalizes (2.4.30). 
The functional S is invariant under the supersymmetry transformation 


i ogi 
(oy) yo - Pi. Cewyyk. eae) 
As in (2.4.55), we recognize the F-term, see (2.4.69). 
The supercurrent 
J% :=2g:dg¢' yPy*w), «=1,2 (2.4.74) 
is conserved (on-shell), i.e., 
Dy J* =0. (2.4.75) 


Again, we wish to consider an interaction Lagrangian with a superpotential W 


Sint = / W(Y (x, B))d*xd70. (2.4.76) 
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A simple and standard choice is 
La 
W(Y)= =,iF — XY, (2.4.77) 
with parameters k, A. The coefficien of 6'6? in the expansion of W is 


—\F —kF¢? —kywe. 


We firs consider the linear sigma model, that is, we set the curvature tensor R = 0. 
In the Lagrangian S4 + Sinz, we then have the F terms 


Dis 2 
se + AF +kFO", 
leading to the algebraic Euler-Lagrange equation 
F=-)—k@’. 
Utilizing this equation, the Lagrangian becomes 
> (1 ‘i Le i 1 nd 
Sat Sine = f dx 50,68" O+ SY" Ou — 5A + key’ — kw we J. 


A more general interaction term is of the form 
1 a | ee a a a eee 6 ae 
int =~ | h(Y)d = J -—— ——— J )d 
Sint >| (Y)d*xd°0 ( 78 ) api api Dagiagi” x 
(2.4.78) 


where the firs term in the integrand arises from eliminating the auxiliary fiel F in 
the combined Lagrangian, in the same manner as before. 


2.4.4 Boundary Conditions 


We start again with the bosonic fiel @ that takes its values in some d-dimensional 
Riemannian manifold N. We now assume that ¢ is define on some Riemann sur- 
face with boundary. The surface will again be denoted by X, and we assume for 
the moment that its boundary is a smooth curve y, or a collection of such curves. 
Boundary conditions for @ on y = 0 are given by specifying a smooth submani- 
fold B of dimension p of N. In the physics literature [86, 87], this would be called 
a brane or a D-brane, with D standing for Dirichlet boundary conditions.'* Lo- 
cally, we can choose coordinates on N so that B is given by x?t! =.--= x4 =0. 


'8In fact, the physics convention is to consider d — 1 spatial and one temporal dimensions. 
A p-brane would then result from f xing p of the d — 1 spatial dimensions. 
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Fig. 2.1. The boundary conditions for bosonic and fermionic f elds define by a D-brane 


The boundary conditions are illustrated in Fig. 2.1 and will now be described in 
formulae. 

The firs part of the boundary conditions requires that the boundary curve y be 
mapped to B. Locally, this therefore means 


gPtl=...=¢4=0 ony. (2.4.79) 


This is, of course, a Dirichlet boundary condition for the components $?*!, ..., 64. 
More generally, the tangential derivative 2 of these components has to vanish 
ony, 

a ro) pt+l _ - a of 


ee eee ; 2.4. 
a7 a7 0 ony (2.4.80) 


The remaining components then should satisfy a Neumann boundary condition, that 
is, with & denoting the normal derivative on y, 


ap° ap? 
Se SO Gas (2.4.81) 
ov ov 

In order to state the boundary condition for the fermionic fiel w, we use the no- 


tation of (2.4.48). We firs assume that at the point under consideration, the metric 
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of N is given in normal coordinates, that is, g;; = 6;;. Then the general boundary 
condition for w is in these coordinates: 


vw! =4tyi. (2.4.82) 


A choice of sign in (2.4.82) can be motivated by the following consideration. As our 
domain, we consider a strip 


{o €[0, 27]} x {t € [0, T]}, (2.4.83) 


and we wish to fi boundary conditions for o = 0, 27 (since the r-direction is inter- 
preted as a temporal direction, there might be initial conditions prescribed at t = 0 
and fina ones at t = T, but this is not our concern here). We assume that we have 
a mapping @ define on this strip, and a vector y along @. We look at the sim- 
plest situation, where N is Euclidean space R?, and the brane receiving o = 0 is 
the hyperplane x¢+! = ...x¢ = 0. For o = 2z, we prescribe another brane that is 
parallel to the firs one, say x4+! = ...x4~! = 0, x4 = R. When we then periodi- 
cally identify these two branes, that is, dividing R@ by the translations by R in the 
x4-direction, in order to get the field y/ to match on the boundary, we need to 
require 


w*Qn,t)=w*(0,t) fork=1,...,p, (2.4.84) 
and 
wv Qn,t)=—-w'(,t) for€=pH+l,...,d. (2.4.85) 


When we then put wi = w and defin w_(o,t) = W,(2m — o,T) (in which case 
the fiel wy on the range o € [0, 277] is obtained from the two field ws on half the 
range, o € [0, 7]), we then obtain from (2.4.84) 


wl=wWi forj=l,...,p, and we =—-wk fork =p+l,...,d. (2.4.86) 


Thus, the plus sign in (2.4.82) corresponds to Neumann boundary conditions for 
the corresponding components of ¢, and the minus sign to Dirichlet conditions. See 
also the discussion in Sect. 2.6.3, around (2.6.47). 

In general coordinates, the boundary conditions for the w-fiel can be written as 


wv! =D! yi, (2.4.87) 
with the tensor Di satisfying 
D! Di, = 63 (2.4.88) 
and 
gij = Di Di gre. (2.4.89) 
Thus 
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is symmetric. Again, there is a sign to be determined, according to Neumann or 
Dirichlet type boundary conditions for @. 

The preceding boundary conditions arise as follows. When we consider varia- 
tions 6¢, dy of the field in S4 (neglecting the F-fiel as this vanishes on-shell 
anyway), we get a corresponding boundary term in the induced variation 6$4. Em- 
ploying the version (2.4.52), this boundary term is given by 


1 ab* 
5 , si( 50! - +iSui wk — byl wk) + sbi Tk wom — vw) dg 
(2.4.91) 


where v as before is the outer normal direction at y and & is a coordinate on y. 
These boundary terms then vanish if 


1. either 
6¢/ =0 (Dirichlet) (2.4.92) 
or 
agk 
Fis 0 (Neumann) (2.4.93) 
2. and 
wi =D) vi, (2.4.94) 


that is, (2.4.87) holds, with the conditions (2.4.88) and (2.4.89). 


In order to keep the theory supersymmetric, the brane B should be totally geodesic, 
that is, every shortest geodesic in N connecting two points in B should already be 
contained in B, see [1]. 


2.4.5 Supersymmetry Breaking 


The Hilbert space of a quantum fiel theory can be decomposed as 
H=Ht eH 


with H* (HH) being the space of “bosonic” (“fermionic”) states. The theory is su- 
persymmetric if there are (Hermitian) supersymmetry operators 


Q,:HoOH, i=1,...,N 


with 


OF faa) eae (2.4.95) 


Witten [104] introduced the operator (— 1)” satisfying 


(-1)Fx =+x for x €H*. (2.4.96) 
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The supersymmetry operators Q; then anticommute with (—1)" 
(-1)"0;+0,(-1)* =0 G=1,...,N). (2.4.97) 


The Q; must commute with the Hamiltonian H that generates the time translations, 
1.€., 
O;,H-HQ;=0 (=1,...,N). (2.4.98) 


The Q; are then determined if we require additionally 


QO? =H, Q;0;+0;0;=0 foriFj. (2.4.99) 


As a square of Hermitian operators, H is positive semidefinite 
Let |b) € H* satisfy 
H|b) = E|b), 
i.e., |b) is an eigenvector H with eigenvalue E (> 0 as H is positive semidefinite) 


We consider one of the Q;, which we simply write as Q for the moment. 
We can write 


Q\b) = VE|f) 
and get 
eee ee eee = 
Olfi= FeO Ib) = Feb) = V EIB). 


Thus, if E 4 0, the bosonic and fermionic eigenstates with eigenvalue E are paired 
in an irreducible multiplet of the supersymmetry algebra. This need not be so any 
longer if E =0. Since H = Q?, if 


H\|bo) =0_ for |bo) € Ht 


we have 
0 = (bo|H|bo) = || Q|bo) ||? (since Q is hermitian), 
hence 
Q|bo) = 9, 
and similarly H| fo) = 0 for | fo) € H{~ implies 
Q| fo) = 0. 


Thus, the zero eigenvectors of H are supersymmetric, that is, invariant under the 
supersymmetry operator. 

Consequently, for positive energy E, the number of bosonic eigenvectors equals 
the number of fermionic ones, but this need not be so for zero energy. 

Let 


v(0) :=#bosonic — # fermionic zero eigenvectors. 
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Regularizing the trace of (—1)", we then have 
Tr(—1)* = v(0). (2.4.100) 


If v(0) 4 0, there must exist at least one—bosonic or fermionic—state with zero 
energy. Since 0 is the smallest possible value of the energy, such a state furnishes 
a vacuum that is supersymmetric. If there does not exist a supersymmetric vacuum, 
ie., if the smallest eigenvalue of H is positive, one says that supersymmetry is 
spontaneously broken. 

We firs consider the supersymmetric point particle with two odd variables with 
the total Lagrangian (see (2.2.86), (2.2.99), (2.2.101)) 


(au 2 ew 
2 ne) d¢? 


tot Lin = f ar( 56d sve viva). (2.4.101) 


The Hamiltonian is 


1 1/dw\* dw 
H==—p’ : 2.4.102 
5P +3(2) + apie ( ) 


As before, see (2.4.77), we choose 


1 
w(¢) = —xkoe — Ao. (2.4.103) 
We obtain 
i oe ey 2 
H= sp t+ sho +a) + 2kowi dn. (2.4.104) 
wy, and w2 are Grassmann valued and odd, and so 
(Wa. Vel = Vale t+ vee =0 fora, B =1,2. (2.4.105) 


After quantization, we get, in place of (2.4.105), 


[Va, Wp] = Moog, (2.4.106) 


that is, the Grassmann variables become Clifford algebra valued. 
We may thus represent the yy by Pauli matrices 


Va = [in Ow (2.4.107) 


1 1 
is a + 5 koe +a)? + hko3¢. (2.4.108) 


and get 


At the so-called tree level (that is, keeping only the zeroth-order terms (those that 
are not proportional to i”, n > 0)—the higher order contain corrections of the tree 
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level), the ground state energy is determined by the potential (k@* + A)*. Hence, 
supersymmetry is spontaneously broken if a >0. 
The supersymmetry generators here are 


O1 = eloip + olk6? +2), Or = slonp — atk p? +2)). (2.4.109) 


For the sequel, it will be convenient to switch to the operators 


(Q1 +iQp). (2.4.110) 


Then 


O7 =0, [O,, O-]=2H. (2.4.111) 


In a cohomological interpretation, we call a state |s) € H( with 


Ox+ls) =0 
closed, one that can be written as 
ls) = Q4|t) forsome |t) e H 


exact. Since O7. = 0, exact states are closed. Conversely, if |sz) is a closed eigen- 
vector of H with eigenvalue EF + 0, i.e., 


H|sz) = E|sez), 


then |tg) := + O-\sz) satisfie 


1 1 
O+ltz) = | O+O0-Isz) = = (0+, 0-Ilsz) (Q-Q+|sz) =0 as sz is closed) 


1 
= —H = ; 
E Ise) =|Sz) 


and hence |sz) is exact. 
If EF = 0 and if we had again |s9) = Q+|fo), then also 


H\to) =0 as[Q4,H]=0. 
However, by (2.4.99), this implies Q1|to) = Q2|to) = 0 as above, hence also 
|so) = Q+|to) =9. 


Thus, the nonvanishing eigenstates for E = 0 are precisely the non-exact closed 
states. 
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2.4.6 The Supersymmetric Nonlinear Sigma Model 
and Morse Theory 


We return to the supersymmetric nonlinear sigma model. We let N be a com- 
pact Riemannian manifold. We assume that the so-called world sheet, the two- 
dimensional domain on which the field are defined is of the form 


{(t,x):teR,xeS'j, 


i.e., a cylinder, whose circumference we assume to have length L. We also assume 
that the field ¢' and w’ are independent of x. We then get 


1 ag! ag! = ea gee he 
Ss= st f a(s ee + gi (PW yaw + FR WP v"), 
(2.4.112) 
After quantization, the spinors y' and their Hermitian conjugates become Clifford 
algebra valued, i.e., 


Woewl=0=" wi], Ww 1= 8). 
Also, after quantization, supersymmetry is generated by the charges 
O+=i0' paw" Dy, Q-=—iv' pi =—-W'Dy, 


with Dy being a covariant derivative, the momentum conjugate to ¢'. 
We now recall from Sect. 1.3.2 that we have a representation of the Clifford 
algebra on the space of spinors given by 


wi _ e(dx/) (e(dx/ ) operates as the exterior product 
with the differential form dx/), 
wi ~ i(dx!) (i (dx!) operates as interior contraction with dx'). 


(This representation is obtained from the one in Sect. 1.3.2 by setting the imaginary 
parts of the differential forms to 0.) 

Thus, y/” corresponds to a differential form, y! to a vector fiel on N (here 
x!,x?,..., are local coordinates on N; one should write o!,7,... in place of 
x!,x?,..., but expressions like d@! look a bit awkward). 

Moreover, Q4+ then corresponds to the exterior derivative d, Q_ to its adjoint 
d*, and the Hamiltonian is 


H=Q,Q0-+0_Q0,=dd*+d*d, (2.4.113) 
the Hodge Laplacian (1.1.109). With Q1 :=d+d*, Q2 :=i(d —d*), we also have 


Hi; 05, (2.4.114) 
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On the other hand, we had interpreted y/ * as a fermionic creation operator, y! as 
a fermionic annihilation operator. The states that are annihilated by all vi , 1e., the 
states with no fermions, are then identifie with the functions f(x) on N. Oper- 
ating on such a state with a y’, we obtain a state with one fermion, or in the de 
Rham picture (see the discussion at the end of Sect. 1.1.3), a one-form on JN. States 
with two fermions must be antisymmetric in the fermionic indices, because of the 
fermion statistics, and can be considered as two-forms. 

Thus, we obtain the de Rham complex, with the Hodge Laplacian. The dimension 
of the space of zero states of this Laplacian, i.e., of harmonic g-forms, is the Betti 
number bg. 

Equating the two pictures gives Witten’s result [104] 


Tr(—1)* = }\(-1)%bq(N). 
qd 


We now add our self-interaction term Lj; with Morse function sh (s here is a para- 
meter) to the Lagrangian Ss. (A smooth (twice continuously differentiable) function 
h is called a Morse function if at all its critical points the Hessian, that is, the matrix 
of its second derivatives, is nondegenerate, that is, does not have 0 as an eigenvalue.) 
This changes d, d* to 


ds =e de, d*¥ =e d*e7*s, (2.4.115) 


We have d? = 0 = d*?, and we get 


O15 =d,+d*%, Q2,5 =i(ds — d=). (2.4.116) 
Moreover, 
Hs = O15 =O, 
= dd; + dsds 
dh oh ah 


= dd* + d*d+5s*gii -[e(dx'), i(dx/)]. (2.4.117) 


ax! axJ i ax! dx/ 
s7 gil al =. is the potential energy, and it becomes very large for large s, except in 
the vicinity of the critical points of h. Therefore, the eigenfunctions of H; concen- 
trate near the critical points of h for large s, and asymptotic expansions in powers 
of i for the eigenvalues depend only on local data near the critical points. This is 
the starting point of Witten’s approach to Morse theory [105], which we shall now 
discuss. 

As mentioned, we assume that / is a Morse function. We let gi, q2,.-., qm be 
the critical points of h. By the Morse lemma (see e.g. [65], p. 311), each critical 
point q, has a neighborhood U,, with the property that in suitable local coordinates 
X=Xy= cam ..., X10) with x, (gp) = 0, 


1 n 
h(p) — h(qv) = 5 Yo Moxy (P)” (2.4.118) 
k=1 
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with 


D?h(qv) = diag(Wy,1, --. 


(1.e., the Hessian of h at g, is diagonalized, and the diagonal elements jz), ... 
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ae) (2.4.119) 


»>My,n 


are nonzero as h is assumed to be a Morse function). 


Also, on Uy we choose a fla Riemannian metric g, for which the 


1,...,”, are orthonormal. 


Of course, we may assume that the U,, v = 1, 


ra) 7 
pi = 


Ox} 


...,M, are pairwise disjoint, and 


moreover that their closures are contained in pairwise disjoint open sets V,. 


is then an open covering of N, and we may fin 
{nv} $1, that is, functions ny : N > R satisfying 


with n, = 1 on U,. 
We choose any metric go on Vo and put 


m 
§ = oS Nv8v- 
v=0 


6 Vin 


a subordinate partition of unity 


suppny CV, 


g is then a Riemannian metric on N. Since neither the Betti numbers of N nor the 
critical points of A or their Morse indices depend on the choice of a Riemannian 
metric on N, we may work with the metric g in the sequel. In this metric, we have 
on U,(v =1,...,m) 


n 2 
a ee , 
(-(=5) +5213 jx! +“, jleldx, ide). (2.4.120) 


In particular, H; is an operator with separated variables on U,. In fact, we have 


n 
/ ] . 
H, = Y(s" + wk") (2.4.121) 


j=l 
with 


(2.4.122) 


2 
Qe A), ate 
Ss ra axl My j ’ 


K/ :=[e(dx/), i(dx/)]. (2.4.123) 
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The operators Q, = Q! ” are just Hamiltonians for harmonic oscillators, and they 
have eigenvalues 


Sly, j](+2N) (N=0,1,2,...) (2.4.124) 
with eigenfunctions 
On (x) = Pu (/siavjlajen 2a, (2.4.125) 


where the Py are Hermite polynomials. In particular, for large s, the @y rapidly 
decay away from x = 0, i.e., away from the critical point g, of h. Moreover, we 
have 


Ki dx™ A... Adx™ = e% (dx™ A... Adx™), (2.4.126) 
with 
1, iffjea=(q... 
ge De BP SOS ante), (2.4.127) 
J —1, otherwise 


ie., K/ has eigenvalues +1. H, thus is a self-adjoint operator with eigenvalues 


n 


s> (1 +2Ny,)|Mv,jl + 8, jHv,j)» (2.4.128) 
j=l 


€y,j =1, Ny; =0,1,2,..., and orthonormal eigenvectors 


eel /slty,j12") 
Ny Ay vi ( Vij 


n 
x 7” 3S pnt Jat Aad iam 
j=l 


(with a, = (ay,1,...,@y,r)). (2.4.129) 


In order for an eigenvalue to vanish, we necessarily have N,,; = 0 for all j, and 
moreover €,,; and 4,,; have opposite signs. Thus, if p, has Morse index p, i.e., 
precisely p of the j,,; are negative, then p out of the ¢, ; must be positive, 
which means that the corresponding eigenvector is a p-form, as can be seen from 
(2.4.126) and (2.4.127). Thus, if a, is a critical point of Morse index p, it has a one- 
dimensional contribution to the nullspace of Hs operating on p-forms, while for 
different Morse index, there is no contribution. 

Now this has been a local consideration, and a nulleigenvector on U,, need not 
extend to a nulleigenvector on all of N. However, a perturbation argument (see, 
e.g., [57] or the monograph [15] for details) shows that the other eigenvalues of H; 
on A?(N) diverge as s tends to oo, while the global nulleigenvectors concentrate at 
the critical points and therefore lead to local nulleigenvectors as considered above. 
We conclude the basic theorem of Morse: 
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Theorem 2.1 
Mp = dp, (2.4.130) 


where mp is the number of critical points of h of Morse index p and by is the pth 
Betti number of N. 


(bp is the dimension of the kernel of the Hodge Laplacian dd* + d*d on Q?(N), 
and one easily sees that the dimension of the kernel of the perturbed Laplacian 
H, = dsd* + dd, is the same for all s.) 

Of course, this is an asymptotic argument, for s — oo, and we only get expan- 
sions of the eigenvectors, in contrast to the original case s = 0 where we could iden- 
tify them with harmonic forms. However, here already the classical, i.e., not quan- 
tized theory, is not entirely trivial; namely while for s = 0, minima of the bosonic 
part of the action S5 were simply constants, for s > 0 the situation becomes more 
interesting. In a sense, the Morse function breaks the symmetry that all points of N 
are equal. 

We consider the bosonic part of our action S5 + Sjn;, again on a cylindrical world 
sheet {(t,x) :t € R,x € S'}, and assuming that the field are independent of x so 
that we can carry out the x-integration. We then have the total energy or Hamil- 
tonian, see (2.1.7), (2.1.9), 


1 dq! dg! Oh Oh 
Hp(¢) = tf aloe + 52g!i(g) a aa (2.4.131) 


Obviously, Hg (¢) = 0 if @(t) = qv, where gq, is a critical point of h. 
These are the classical solutions. We next consider tunneling paths or so-called 


instanton solutions between such classical solutions. 
Given two critical points q,,q,, we have to fnd 


g:R-N; lim @(t)=q, lim $(t) = qu (2.4.132) 
t—>—oo too 


minimizing 


1 dd’ dd! 23 oh oh 
Hp(¢o) = st f ar(s SE + s2¢%@) ) 


dt dt agi agi 
1 dq! .. oh |? d(ho¢) 
—--Lid + so) —_ L 
2 / ‘lar 8 agil| *° fa dt 


> tsL( lim h(@(s)) — lim h@O)) 


=xtsL(h(qv) —h(qu)), (2.4.133) 
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do) 1 | ij any doe 1 ook ah 


dgi a 
gi ij Dh |)2 — gip( ak Sg agi 6 dt x= Sg 3D)" 


a x Sg agi 
Equality occurs precisely if 


(using the simple relation || 


d¢' ij Oh 
= tg!) —, 2.4.134 
a 8 agi ( ) 
1.€., 
d 
< =¥5(Vh) og 


(2.4.134) means that, up to sign, @(t) is a curve of steepest descent for h. 
Thus, the minimum action paths between any two critical points are paths of 
steepest descent, and the action of such a path is 


sL|h(qv) —h(qu)I- 
We now let g be a critical point of h of Morse index p, and we let r1,..., 7 be the 
critical points of Morse index p + 1. We put 


m 


5lq) = D5 ng. ry)Iru). 


w=1 


Here, we associate to each critical point q of index p a basis vector |g) of a vector 
space V,. We put 


N(4,Tu) = s np 


Pp.) 


where I(r, q) is the path of steepest descent from r,, to g, and where np is +1 
according to the following rule. 

By the above considerations, each critical point g of index p corresponds to a p- 
form localized near that point, and this p-form yields an orientation of the subspace 
of T, N spanned by the p negative eigendirections of the Hessian of h at g. Atr,, we 
thus have a (p + 1)-form, and in fact the direction of steepest descent corresponds to 
the eigendirection for the smallest eigenvalue of V7 hr.) If we thus transport this 
(p + 1)-form parallely along [' and contract it with the tangent direction of I, we 
obtain a p-form at g. Comparing the resulting orientation at g with the one coming 
from the p-form corresponding to gq then determines whether ny is +1 or —1, Le., 
np = | if they agree, nr = —1 else. 

The important point is that 


67 =0. 


This can be verifie directly or deduced from representing 6 as the limit of ds for 
s — oo. (Note that in any case d; : V» > Vp+1 also yields a coboundary operator, 
ie., d? = 0.) It is a standard result of algebraic topology that once one has such 


176 2 Physics 


a coboundary operator, one obtains the strong Morse inequalities encoded in the 
formula 


Yi mpt? — ) byt? = +000), 
P P 


where Q(t) is a polynomial with nonnegative integer coefficients 

A more general version of the supersymmetry algebra arises if, in addition to the 
Hamiltonian H, we also have a momentum operator P, and if the supersymmetry 
operators Q1, Q> satisfy 


OQj=H+P, O3=H-P, Qi02+ O20: =0. 
These relations imply 
[O;,H]=0=[Q;,P] fori=1,2. 
Also, 


H= 50+ 0} 
is again positive semidefinite 

A realization of this supersymmetry algebra arises as follows. 

Let X be a Killing fiel on our compact Riemannian manifold N, 1.e., an infini 
tesimal isometry of N. Let Ly be the Lie derivative in the direction of X, and i(X) 
the interior multiplication with X of a differential form. For s € R, we consider 


dy =d+si(X). 
Let d¥ be the adjoint of d;. Since X is a Killing field one computes that 
dy? = —d?. 
Also 
dd a2! 
5s = —SLy. 
The Hamiltonian is 
Ay = d,d; ae dds : 
The supersymmetry operators are 
1 1 1 1 
O15 =i2d,+i 2d%, Q2,5 =i 2ds +i2d>. 
Definin 
P=2isLx, 


we then have the above supersymmetry algebra, 


Qj=H+P, Q5=H-P, 0102+ 0201=0. 
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More generally, one may use a function / invariant under the action of X, i.e., 
i(X)dh =0, 
and put 
ds\.s9 = e M2, elts2, 
Thus, the parameter s; corresponds to the Killing fiel X, whereas sz corresponds 


to the Morse function h. The supersymmetry generators are then 


1 1 
was ay ae 
O1.s1,59 = i172 ds, 55 +i 2d 


81,52? 
iol 1 oy 
22.51.52 = Zds1.9 +1 th 5 
and 


* 
+5) 54s1,595 


As,,5) = ds,,594, 


* 
51,82 


P=2is\Ly. 


We return to our supersymmetric nonlinear sigma model with a cylindrical world 
sheet R x S, where the space S is a circle of circumference L. Instead of considering 
maps 


@:RxSON, 


we may equivalently consider maps 


w:R- Q,;(N), 


where {2;(V) is the loop space of maps from S to N. The loop space Q;(N) will 
now play the role of our target manifold. Of course, in contrast to what was assumed 
for our target manifold N, Q;(N) is not compact. 

The group U(1) of rotations of S acts on Q;(N) by isometries, simply by map- 
ping a loop y(t) to the loop y(t + a) (the addition in S is the one in R mod L). As 
before, we may defin the operators 


d;=d+si(X), H, =ad,d; +did,, 


where X is the generator of the U(1) action. 

Of course, due to the fact that Q, (4) is infinite-dimensional certain problems of 
convergence arise when trying to carry over the preceding finite-dimensiona analy- 
Sis. 

The approach to Morse theory via the supersymmetric sigma model is due to 
Witten [104, 105]. This in turn led to Floer’s approach to Morse theory that con- 
structs the Morse complex from counting fl w lines between critical points, see [37] 
and the expositions in [65, 96]. 

The supersymmetric action functional (2.4.112) can also be utilized for a proof of 
the Atiyah—Singer index theorem [7], as discovered by Alvarez-Gaumé [3], Friedan 
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and Windey [41, 42] and Getzler [47, 48]. Systematic expositions can be found 
in [10] and [43]. 


2.4.7 The Gravitino 


The preceding considerations were local insofar as the supersymmetry variation pa- 
rameter € was assumed to be constant. It turns out that from a more global perspec- 
tive, e has to be considered to be a section of some bundle and cannot in general be 
taken to be constant. This implies that also derivatives of ¢ will enter the supersym- 
metry computations. We address this issue now and see that it will lead us to very 
interesting geometric structures. 

We start with the linear supersymmetric sigma model from Sect. 2.4.3, that is, 
the extension of (2.4.3) with a supersymmetric partner for the scalar fiel @, an 
anticommuting spinor fiel wy: 


S(o,¥, XL) i= 5 | od", + Wi y% dq Wa) dz. (2.4.135) 


Here, the y~,a = 1,2 are standard Dirac matrices, define by a representation 
of Cl(2,0) as above. (Note: In the physics literature, one usually works with 
a Minkowski world sheet, that is, one takes an indefinit metric on the underlying 
surface, and consequently considers C/(1, 1) instead.) 

The equations of motion, that is, the Euler-Lagrange equations for (2.4.135) are 
simple linear equations ((2.4.58), (2.4.59)): 


dub" = 0, (2.4.136) 
yay? =0 fora=l,...,d, (2.4.137) 


that is, @ is harmonic and w solves the Dirac equation. 
Similarly, one can consider a metric g instead of only a conformal structure and 
consider the functional 


SO, g) = 5 [Pio opt, + Woy dy Wa)Vdetgdz'dz?. —(2.4.138) 


One then has the supersymmetry transformations (2.4.65): 
6¢° =e", (2.4.139) 
by" = yy duh (2.4.140) 


with an anticommuting ¢. (Of course, mathematically, one should consider this as 
a transformation of the independent variables of an underlying superspace instead 
of as a transformation of the fields. The commutator of two such transformations 
yields a spatial translation: 


[51, 62] = 81 (En ¥*) — 82(e1 We") = 261" E20g". (2.4.141) 
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In fact, these are infinitesima transformations that integrate to local ones, but we 
also need to consider the global situation. Globally, instead of a translation, we have 
a diffeomorphism, and so the supersymmetry transformations should generate the 
superdiffeomorphism group of the underlying supersurface. Also, globally, ¢ is not 
a scalar parameter, but transforms as a spin-1/2 field that is, mathematically, a not 
necessarily holomorphic, anticommuting section of K'/?, K being the canonical 
bundle of © (for some choice of a square root of K, that is, of a spin structure). 
(Even though, w.r. its z-dependence, ¢ transforms as a section of K!/?, it also 
contains an independent odd parameter; therefore, ey = —we, but in general, we 
do not have ew = 0.) 

A supersymmetry transformation induces a variation of S; this is computed as 
(cf. (2.4.56)) 


6S = -2 f Age J (2.4.142) 
with the supercurrent 


Ju = 5” Ya” Opa: (2.4.143) 


Likewise, for a spatial translation, we get the energy-momentum tensor: 


1 - 1 - 
Tap — dv" Opba + Vi Yadp Va + a¥° vp0aVa — trace. (2.4.144) 


Of course, this is the appropriate generalization of (2.4.9). As before, it is traceless, 
and again, this can be seen as expressing a (super)conformal invariance. Also, as 
before, both the supercurrent J and the energy-momentum tensor T are divergence- 
free when the equations of motion hold. With the same implicit identification as in 
Sect. 2.4, T is a holomorphic quadratic differential on X, that is, a holomorphic 
section of K?, while J is a holomorphic section of K>/*. 

The preceding facts have several important consequences: 


e In line with the general concept of supergeometry, the space of independent vari- 
ables for the @ and w field should be a superspace, that is, here it should be 
a super Riemann surface (SRS). Then, in the same manner that the Dirichlet in- 
tegral, the action functional D(@, X), yielded a (co)tangent vector to the moduli 
space M, when varying &, now variations of © for S(@, w, X) should yield a 
(co)tangent vector to the moduli space of super Riemann surfaces. From this, we 
infer that the tangent space to that space should be given by even holomorphic 
sections of K? and odd holomorphic sections of K3/. In particular, the even 
dimension should be 3p — 3 as before while the odd one is 2p — 2, again by 
Riemann—Roch. 

e As before, our action functional is only invariant on-shell, that is, when J is holo- 
morphic. From (2.4.142), we see the obstruction to global invariance, namely the 
nonvanishing of d,é. As a spin-1/2 field ¢ is a section of a nontrivial bundle and 
therefore cannot be taken to be globally constant. Thus, the obstruction to full su- 
perdiffeomorphism invariance comes from the global topology of the underlying 
surface. 
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In order to understand these issues better, we now make the fundamental observation 
that the functional S from (2.4.135) or (2.4.138) does not yet constitute a full su- 
persymmetric generalization of the functional S(@, g) studied in Sect. 2.4. Namely, 
we have only given @ a supersymmetric partner, but not our other field namely the 
metric g. We shall do that now and see that this yields a fully satisfactory theory that 
gives a profound understanding of the moduli space of super Riemann surfaces. 

In place of the metric (gyg), it is convenient to consider a zweibein e@, from 
which we can reconstruct the metric as gag = duped eb. In other words, we intro- 
duce an additional U(1) symmetry which, however, can be easily divided out since 
that group is compact. The supersymmetric partner of the zweibein is then a grav- 
itino (Rarita-Schwinger field x4, where A = 1,2 is a spinor index that will be 
suppressed in the sequel, whereas a is a vector index as before. Thus, x transforms 
as a spin-3/2 field This might already suggest how to obtain the moduli space of 
super Riemann surfaces in analogy to 4 of Sect. 1.4.2. Namely, one would take the 
space of all metrics (equivalently, after dividing out the U(1) symmetry, zweibeins) 
and gravitinos, and then divide out all the invariances, that is, the superdiffeomor- 
phisms and superconformal scalings. However, although this idea is conceptually 
insightful, the actual construction of the moduli space of super Riemann surfaces 
proceeds differently, see [93].!° In fact, because the spaces involved, like the one of 
superdiffeomorphisms, are necessarily infinite-dimensional Sachse had to replace 
the standard approach of ringed topological spaces by a categorical reformulation 
of supergeometry, see [94]. 

The supersymmetry transformations of the field $, w, e, x are then 


8Xa = dé, (2.4.145) 
beg = —28y* Xa, (2.4.146) 
bp = ey", (2.4.147) 
by4 = E(u" — W" Xa): (2.4.148) 


The supersymmetric functional is then 
S(O, W,8, Xx) 


1 - - 
= 2 / (e*a.0°o, + yy da Wa + aye yw dpha 
S 


io 
+ Vo" kav 7" xp ) Vdetedel dc, (2.4.149) 


'Using the zweibeins directly would mean taking the phase space of a 2D supergravity theory 
as the gauge theory for supersymmetry. One would then in addition need a super connection on 
x whose coeff cients are the gauge fields Dividing out the invariances involved becomes very 
complicated, and therefore, it is better to proceed as in [93]. 
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Here, the firs two terms are those from (2.4.138), the third one is introduced to com- 
pensate (2.4.142), and the last one is then needed to compensate the terms coming 
from the variation of dg¢@ in the third one. 

We have thus obtained a functional that is fully supersymmetric even off-shell. 


Summary: We see the merging of a profound mathematical concept, namely that 
of a moduli space of Riemann surfaces and a deep method from theoretical physics, 
namely the symmetries of action functionals. This suggests a unique concept of 
a super Riemann surface, for which we have already described the super moduli 
space. It remains to be seen how the approaches of Sect. 1.4.2 extend to this setting. 
Ideally, they should as beautifully coincide as in the situation of ordinary Riemann 
surfaces. 


Of course, the preceding formalism can be recast into the mathematical frame- 
work of supergeometry. 

We have considered only one of the two supersymmetries arising in string theory, 
namely world-sheet supersymmetry, but not space-time supersymmetry. The latter 
refers to the target space, which we have taken to be Euclidean space here. For ex- 
ample, while w transforms as a spinor on the domain, it transforms as a vector in the 
target space. For a discussion of space-time supersymmetry, see, e.g., [50]. Here, 
instead, we replace the Euclidean target space by a Riemannian manifold N. Equa- 
tion (2.4.138) then becomes the supersymmetric nonlinear sigma model of quantum 
fiel theory as treated in the preceding section. The equations for ¢ and y then 
become nonlinear and coupled, and in fact, w is a spinor-valued section of d*T N, 
the pull-back of the tangent bundle of N under the map ¢. Naturally, one can also 
include the field g and x into these considerations, by expanding not only with 
respect to the map into N, but also with respect to the domain metric. 

The supersymmetric action functional with gravitino term is discussed in [26, 
50], with more details in [27]. The moduli space of super Riemann surfaces has been 
constructed from the global analysis perspective advocated here by Sachse [93]. 
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We can now bring the material of the preceding sections together and discuss general 
(Gaussian) functional integrals. These are formal integrals of the form 


[ e 5) (2.5.1) 
where S(¢g) is some quadratic Lagrangian action as introduced in Sect. 2.2 and we 


formally integrate w.r.t. to some collection of field g. We can, of course, also intro- 
duce Planck’s constant and replace (2.5.1) by 


[ eH SO), (2.5.2) 
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When we consider the heuristic limit h — 0, we see that the minimizers of the 

action S dominate the functional integral more and more, because other field 

yield exponentially smaller contributions. For physicists, it is then natural to perform 

an expansion of (2.5.2) in terms of i the so-called stationary phase approximation. 
Certainly, one can also consider the oscillatory integral 


[ en 5), (2.5.3) 


which we may view as a generalization of the Feynman path integral discussed in 
Sect. 2.1.3. 

As before, see Sects. 2.1.2, 2.1.3, we consider Gaussian functional integrals as 
formal analogs of Gaussian integrals with infinitel many variables. In addition, we 
shall make use of the invariance considerations in Sect. 2.3.2 to divide out symme- 
tries. 

There is one general issue that can be contemplated at this point: It is a general 
principle of quantum fiel theory that no arbitrary choices are permitted. When- 
ever something is selected from some class of possibilities, one should integrate 
out the possible values of the selection, weighted with some (negative or imagi- 
nary) exponential of the underlying action. Thus, we consider (2.5.1) when we have 
a collection of field gy. After normalization, we consider 7 Do e°Y) (where the 
constant Z has been chosen so that the total integral of the measure becomes 1) 
as a probability measure on the space of field (similar to a Gibbs measure in sta- 
tistical mechanics). For any function f(g) of the fiel gy, we can then compute its 
expectation value as 


5 i Dg fe), (2.5.4) 


In mathematics, instead of taking a functional integral, in the situation where some 
underlying structure has to be selected, one attempts to equip the space of all pos- 
sible choices with some geometric structure. That is then called a moduli space. 
Above, we have discussed the moduli space of Riemann surfaces. 


2.5.1 Normal Ordering and Operator Product Expansions 


The following example will bring out the essential aspects. Let (MV, g) be a compact 
Riemannian manifold of dimension d. For a function g on M, we put 


1 


Ara’ 


Sg) = [aver + m2?) dVolg(M) 


1 
= aa (-—Agt+ m?)@) 12, (2.5.5) 


where a’ is a constant, the so-called Regge slope. Thus, this is essentially the same 
functional as the one considered in Sect. 2.4, see (2.4.3), with the difference that here 
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we have an additional mass term and a different normalization factor in front of the 
integral. A = Ag is the Laplace—Beltrami operator of (M, g), define in (1.1.103), 
(2.4.4). 

We note some differences here compared to Sect. 2.1.3. There, we had taken 
functional integrals for paths x (in some Euclidean or Minkowski space), that 
is, mappings x : [t’, t’”] > R%, say, with fi ed boundary conditions x(t’) = x’, 
x(t”) =x”. Here, we are integrating functions over a more general domain, namely 
a Riemannian manifold, and we do not impose boundary conditions. In fact, M may 
be some closed manifold without boundary. If M does have a boundary, we can also 
impose a boundary condition via an insertion into our functional integral. 

According to the general scheme just discussed, the choice of the manifold 
(M, g) represents an arbitrary choice, and therefore, one should integrate out all 
such choices, that is, take another functional integral w.r.t. all possible metrics g on 
M, and perhaps also a sum w.r.t. all diffeomorphism types of M. That is, in fact, 
done in string theory, where the dimension of M is fi ed to be 2 and one then for- 
mally integrates w.r.t. all metrics and sums with respect to the different genera of 
the underlying surface. 

We also consider the propagator of the free fiel of mass m, or, in mathematical 
terminology, the Green operator 


G=2na'(-A+m?)'. (2.5.6) 
Thus, 


1 
S~)=5, G7'9);2. (2.5.7) 


The fundamental object of interest is the partition function (in older texts, this is 
denoted by the German term Zustandssumme) 


Li= / Dg exp(—S(g)) 


1 
= / Doexn(—50 o'6)) (2.5.8) 


with a formal integration over all functions g € L?(M). 

The analogy with the above discussion of Gaussian integrals (2.1.24), obtained 
by replacing the coordinate index i in (2.1.24) by the point z in our manifold M, 
would suggest 


Z =(detG)?, (2.5.9) 
when we normalize 
dQ 
D =| | 2.5.10 
, : Jn ( ) 


to get rid of the factor (27r)” in (2.1.24). Here, the (g;) are an orthonormal basis of 
the Hilbert space L?(M), for example, the eigenfunctions of A. 
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The idea is then to defin detG as the renormalized product of the eigenvalues 
An of G. The mathematical construction is based on the Weyl] estimates. By these 


estimates, the 4,, behave as O(n-2). Motivated by (2.1.26), one then puts 
det G := exp(—¢6(0)), (2.5.11) 


where the ¢-function G(s) is the meromorphic continuation of >*,,A;*, define 
for Re(s) < —{, to the entire complex plane; it is analytic at 0. This procedure is 
called zeta function regularization. The determinant define by (2.5.11) has certain 
multiplicative properties like the ordinary determinant, see e.g. [111]. 

Comparing (2.5.8) with (2.1.24), the analogy is then that the coordinate values 
x!,...,x% get replaced by the values of the function g at the points y € M. That is, 
we have infinitel many degrees of freedom, corresponding to the points y € M in- 
stead of to the discrete indices i = 1,...,. The values of these degrees of freedom 
are then assembled into the function g in place of the vector x = (x!,...,x”). 

In analogy with (2.1.30), for points y,,..., ¥m € M, we may then defin correla- 
tion functions 


1 
(901) ++ Gm) = 7 | Pee)---e0%m) exp(—Ste)) (2.5.12) 


Note that, in contrast to Sect. 2.1.3, here we are normalizing the integrals by dividing 
by Z, so that these correlation functions can be interpreted as the expectation values 
of the product of the evaluations of the field at the points y,,..., yj. Again, these 
vanish for odd m (because a Gaussian integral is quadratic in the fields hence even), 
and as in (2.1.31) 


(gO) ¢02)) = G01, y2). (2.5.13) 


Here, the Green function G(y1, y2) is the kernel of the operator G, and it has a sin- 
gularity at yj = yo, of order log dist(y,, y2) for d = 2 and dist(y1, y2)*4 for d > 2. 
Likewise, the analog of Wick’s theorem (2.1.32) holds. 
We now specialize to the case where the particle is massless, i.e., m = 0 in (2.5.5), 
and M is a Riemann surface &. Thus, in complex coordinates, the action is 


s= 


: [ @woeie. (2.5.14) 
27a’ 
We note that the metric g here disappears from the picture. This comes from the fact 
that S in (2.5.14) is conformally invariant, that is, remains unchanged when the un- 
derlying metric is multiplied by some positive function, and therefore depends only 
on the conformal structure, that is, on the Riemann surface on which it is defined 
but not on a particular choice of a conformal metric on that Riemann surface. The 
issue of conformal invariance plays a fundamental role in conformal fiel theory 
and string theory, see [26, 46, 62]. 

The classical equation of motion is (2.4.21), 


ddy(z, Z) =0. (2.5.15) 
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One writes the argument here as (z, Zz) instead of simply z, because the notation 
f(Z) is reserved for a holomorphic function, as explained in Sect. 1.1.2. (2.5.15) 
implies that g is a holomorphic function dy(z), and dg is an antiholomorphic 
function dg(Z). 

The complex coordinates 


z=x! +ix?, 


NI 
> 


admit a Minkowski continuation with x° = —ix*. Then, a holomorphic function is 
a function of x° — x!, an antiholomorphic one is a function of x° + x!. One calls an 
(anti)holomorphic function left-(right-)moving. 

As before, we wish to compute the expectation values 


1 
(FW) =5 / Dy exp(—S)F(g). (2.5.16) 


We shall now repeat some of the discussion of Sect. 2.1.3 and see how it applies 
to the present situation. The above analogy between ordinary integrals and path 
integrals said that the finitel many ordinary degrees of freedom, the coordinate 
values of the integration variable, are replaced by the infinitel many function values 
y(z, Z). Therefore, integration by parts should yield that 


r) 
0= | De exp(—S). (2.5.17) 
This gives 
0=-| > exp(—S) ——— 28 
Pee 8G) 
and so, 
6S 1. - Z 
0= ( - )= (d0@(z, Z)). (2.5.18) 
dp(z,z)} ma’ 


Thus, the classical equation of motion (2.5.15) becomes an equation for the expec- 
tation value of the corresponding operator. Equation (2.5.18) can also be written as 


—a.8:(0(6,9) =0. (2.5.19) 


Let us return to (2.5.16). The functional F(gy) typically represents certain linear 
combinations of products of evaluations of g at points z1,..., Zn € X&. When none 
of those points coincides with the point z for which we take the functional derivative 
ae) ae the preceding computation also goes through for F'(@). 

Things change when one of those insertion points is allowed to coincide with z. 
In Sect. 2.1.3, that led us to the temporal ordering scheme for operators. Similarly, 
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here, from the analog of (2.1.125), we shall be led to the so-called normal ordering 
scheme. 
For example 


5 7 
o= i: Doss ex(-S9(6. 5) 


- 1 . 
= [ Dee-s)(e b.2 £) + = (cdevle, 0.8). (2.5.20) 


Thus 


- 1 - 
0= (ac 6,2 B) + —ade0(e, Ue. )} (2.5.21) 


Again, this is not affected by other insertions not coincident with z. 

We thus interpret (2.5.18) and (2.5.21) as operator equations, that is, as holding 
for all components of the corresponding quantum mechanical operators, since these 
are precisely obtained by such insertions. 

We thus write the operator equation 


1 - Z 
—, 9,029(z, 29 (S, 6) =—8Z—-6,Z—-$), (2.5.22) 
ITO 


as in (2.1.125). When we solve (2.5.22), we therefore obtain a Green function type 
singularity, log |z — ¢|°. 

In order to eliminate this contribution, one considers the normal ordered opera- 
tors 


:9(Z, Z): = (Z, 2), 
o! : (2.5.23) 
:9(Z1, 21) 9(Z2, 22) : = O(Z1, 21) (Z2, Z2) + 5 leglzi — 2°. 


This quantum correction will below lead to a central extension of the Lie algebra of 
the diffeomorphism group of the circle (see (2.5.63), (2.5.66) in Sect. 2.5.3). 
We then have 


91 91:9 (z1, 21) 9(Z2, 22): =0. (2.5.24) 


Thus, :9(z1, Z1)@(z2, Z2): is a harmonic function and therefore locally the sum of 
a holomorphic and an antiholomorphic function. From this, we obtain the Taylor 
expansion 


(21, Z1)@(Z2, 22) 
a’ 

= ——log|z) — 221 
5 g1Z1 — 22 


CO 1 _ 
+ > mr (Ca — z2)":98" p(Z2, Z2): + 1 — Z2)":99" Y(Z2, Z2):) (2.5.25) 
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(mixed terms with 34 vanish by the equation of motion; note that in general, deriv- 
atives need not commute with normal ordering). 

Equation (2.5.25) is the prototype of an operator product expansion (OPE). As 
discussed, the g’s here are considered as quantum mechanical operators. 

The transition from functions to operators needs some explanation. In (2.5.16), 
we can add insertions /!,..., 7’, that is, functions of gy evaluated at some points 
Z1,--+;%m € M. Thus, we have expressions of the form 


1 
’ / Dg exp(—S)F (g)I' (9) (21, 21) ++") Zn Zm)- 


More generally, we can also have insertions of the form 


fre Zz; 9) du(z) 


for some measure djz(z). For example, these insertions can be certain boundary con- 
ditions represented by Dirac functionals. When we do not specify these insertions, 
we simply write 


(F(Q)--+). 


F (g) then determines an operator F (g) operating on such insertions. (F'(g)) is the 
matrix element (O|F (g)|0) of F (gy) where |0) is the vacuum. 


2.5.2 Noether’s Theorem and Ward Identities 


Before proceeding, we need to translate Noether’s theorem into the operator setting. 
The result is a Ward identity. 
As in Sect. 2.3.2, we consider a general Lagrangian action 


s= | Fw(w,deeas (2.5.26) 


and transformations 


/ 
XxX’, 


p(x) g(x’) = W(e(x)). 
Infinitesimall , 


x’ =x-+s6x, (2.5.27) 
g(x’) = o(x) + sdW(x). (2.5.28) 
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By (2.3.28), the Noether current is 


y 5xP 5 
jt= (-F poe de igF) ciadierae eas (2.5.29) 
x i i 


Z / dxdy j%s;. (2.5.30) 
According to Noether (2.3.29), invariance implies a conserved current: 
do jf’ = 0. (2.5.31) 


We now turn to the quantum version, that is, invariance of correlation functions, 
when action and functional integral measure both are invariant: 


(POX) POH)) = (WG) W@On))) (2.5.32) 


by renaming variables (yg +> g’) and transforming Dg’ to Dg. 
Ward identities express symmetries in QFT as identities between correlation 
functions. According to (2.3.27), the fiel variations are given by 


bw dx dag 
Go:= — - ——. 2.5.33 
? és Os Ox ( ) 


For a collection ® = g(x1)--- g(x,y) of fields we have by invariance 


; / De® exp(—S(g)) 


= () 
1 
=5 i Do'(® + 5) exp(- (sw + / dxdq 52S; «»)). 
If the measure is invariant, i.ec., Dy’ = Dg, then by differentiating w.r.t. s gives 


(50) = / dx da (jx) ®)54(x) (2.5.34) 


(note that ® does not depend explicitly on x, and thus 0(j(x))® = 0(j(x)®)). 
Since 


== Vea ++ GQ(Xk) ++ P(X) 5 (Xk) 
k=1 


= f 4x50) PO) GG) ++ WC) (x = 8). 
k=1 
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we obtain the Ward identity for the current /: 


n 


0 
g,0 I er) +++ @(Xn)) = Yip) ++ GP(XK) +++ PAn))d% — XK). (2.5.35) 
ve k=1 


0 x9 


We now assume that the time t = xy is different from all the times x,,..., x, 
occurring in ®. We integrate (2.5.35) between t — ¢ and t + e for small ¢ > 0 to 


obtain 
(Q(t + €)px1)®) — (Qt — €)9@1)®) = (Ge) ®) (2.5.36) 


for the charge Q (define as in (2.3.40)). When we time order the operators, we 
need to exchange Q(t — €) and g(x), because t-—¢ <t= re Since (2.5.36) holds 
for any such ®, we obtain 


[O, p] = Gg. (2.5.37) 


Thus, the conserved change Q is the infinitesima generator of the symmetry trans- 
formations in the operator formalism. 

If instead of a Minkowski space-time, we consider Euclidean space, the time 
ordering is replaced by a radial ordering of the operators as will be discussed in 
Sect. 2.5.3 below. 


2.5.3 Two-dimensional Field Theory 


We now compare the preceding with 2-dimensional fiel theory. We have a spatial 
coordinate w! that may be bounded or periodic, 


wi ~ w! +27, (2.5.38) 
and a Euclidean time coordinate t = w?, 
—00 <w? <0. (2.5.39) 
We put 
w=w!+iw (equal time coordinates are horizontal lines) (2.5.40) 
and 
z=e'” (equal time coodinates are concentric circles C about 
origin z = 0 which corresponds to the infinit past w = —00). 
(2.5.41) 


When going from the Minkowski coordinates w!, w* to the complex coordinates 
z, temporal invariance w? > w* +t then becomes radial invariance z > Az with 
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2X €R. This is the starting point of conformal invariance and constitutes one moti- 
vation for conformal fiel theory below. 

In the z-coordinates, the charges Q (2.3.40) then become contour integrals of 
currents j 

dz , 
O{C}= oe (2.5.42) 
Cc 471 

Here, we assume that the current j is meromorphic, without poles on the contour C, 
of course. 

We now consider 


Q1{C1} Q2{C2} — Q2{C2} Q1{C3}. (2.5.43) 


This corresponds to a time ordering Tt) > T2 > 73. 


C2! Ch 


Therefore, when we time order the operators 0; corresponding to the Q;, we 
obtain the expression 


0102 — 0201 =[Q1, Onl. (2.5.44) 


We now consider a point z2 € C2, and we can deform the contours as follows: 


When we consider infinitesima time differences, tT) = tT2 + €, T3 = T2—€,€ > 0, 
we contract the contour C,; — C3 to Co, that is, the small circle about z2, to the 
point zo. 

We obtain from (2.5.42)—(2.5.44), leaving out the * for the operators as usual, 


d 
[01, QC} = 222 Reg. 5 zy Ji(21) Jo(22). (2.5.45) 


C) 20i 
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This is a fundamental relation. On the |.h.s., we have the commutator algebra of the 
charges, while on the r.h.s., the singular terms in the operator product expansions 
(OPEs) of the currents appear. 

Instead of the conserved charge Q2{C2}, we can also take an operator A(z, z) to 
obtain 


LQ, A(z2, 22)] = Resz, +2, j (21) A(Z2, 22). (2.5.46) 


When Q is the conserved charge for a variation 6, as in Sect. 2.3.2, 


p(x) > g(x) + ies(x), (2.5.47) 
we have, by (2.5.37), 
1 
[O, A(z, z)] = = ae Zz (2.5.48) 
(2.5.46) and (2.5.48) yield 
c 1 _ 
Resz,-+2) J (Z1)A(Z2, 22) = ~ 7, 5A, Z2). (2.5.49) 


We now consider a conserved current j in a two-dimensional fiel theory. As 
a conserved current, by (2.3.22) and (2.3.29), it is divergence free, that is 


Oz jz + 0; jz = 0. (2.5.50) 


Taking as a model the energy-momentum tensor T in Sect. 2.4, we now assume that 
we have 


jz = OZ jzg + OZ jez, ig = 82 jeg + 82 fz (2.5.51) 


for some holomorphic variation 6z. Equation (2.5.50) then becomes 


03 jizz a Oz Jez =0, 


(2.5.52) 
Oz jez + Oz jez = 0. 
We also assume that the tensor (jz,, .. .) is symmetric: 
Jez = Sez» (2.5.53) 
and (noting that tr j = g7? jpg = 9° jz, + 9% j-z) trace-free: 
jz =09, (2.5.54) 
which it has to be for the theory to be conformally invariant. 
These relations imply that it is holomorphic: 
dz jzz = 0, 
(2.5.55) 


0z zz = 9. 


192 2 Physics 
We put 
J) = Jez), 
i(@) = jz@). 
This implies that f(z)j(z) is conserved as well: 
éz(fj) =0, (2.5.56) 


for any holomorphic function f(z). 

Thus, we obtain infinitel many conserved currents. In two-dimensional fiel 
theory, this corresponds to the fact that the local conformal group is infinite 
dimensional, as conformal invariance led to the energy-momentum tensor T as our 
conserved current j in Sect. 2.4. 

For each holomorphic f, we therefore obtain a conserved charge 


Qr= f “ fore (2.5.57) 
which generates the conformal transformation 
zee ztef(z). (2.5.58) 
According to (1.1.89), the induced transformation of a fiel (z, Z) is 
PZ,Z) > OZ, 2) +5 ¢ FOZ, Z) (2.5.59) 
with 
5 ¢ FP, 2) = (hd. f +hdzf + fa. + fa) 9,D (2.5.60) 


where h and hare the conformal weights of . 
We consider an (h,0)-form (z, Z)(dz)". Equations (2.5.48), (2.5.49) and 
(2.5.57) give 


—1O Fr, e()] 
—Resz, 32, f (ZT (z1) 9(2) 


5 ¢9(Z) 


d 
7 § = Fer evo, (2.5.61) 
Cc 471 


where C is now a small circle about z. 
Since h = 0 here, we obtain from (2.5.60) and (2.5.61) that 


hg(z) 029 (Z) 


T(zi)g(z) = 5 + finit terms. (2.5.62) 
Gi) 23 


In particular, for an (A, 0)-form, the conformal weight h can be recovered from the 
operator product with the energy-momentum tensor. 
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If we take, instead of gy, the energy-momentum tensor T itself in this OPE, 
we obtain an additional term that essentially comes from the fact that T involves 
a square of derivatives of field which induce additional commutator terms: 


c 2T (z) d-T (Z) 


T(z)T(z)= + finit terms. 2.5.63 
2(zi—z)* (i -zy az 7 


Here, c is some constant, the so-called central charge. Since T is holomorphic, we 
can Laurent-expand it: 


[ee 


Lin 
TQ) De ao (2.5.64) 
m>=—OoO 
that is, 
dz m+1 
i= ere T(z) (2.5.65) 
0 


for a circle Cp about the origin z = 0. 
The L,, are the generators of the Virasoro algebra 


(ene inf dz dz el c 4 2T (z) + 4 
ns 4m) = ae a 
Cy 2mi Sc, 2ni! 2zi-z)* (iz uz 


Cc 
= pe —D)@t1)bmtn + (2 -—m)Lintn- (2.5.66) 
To obtain this, one uses 
zt} = (zy —z) +2)" 
3 n 


6 
+ (nt (qp—zer +e" $e. 


n2>+n 


(z1 J zz"? + a 


(z1-2z 


Summary: The generators of the Virasoro algebra are the Laurent coefficient 
of the energy-momentum tensor 7. The expansion comes from the holomorphicity 
of T, which in turn follows from the invariance properties of CFT. Since, in contrast 
to the classical action, the quantum expectation values are not conformally invariant, 
we obtain a central charge c 4 0 in the commutators of the L,,. 


Lo, L; and L_, generate an algebra isomorphic to sI(2, R), the Lie algebra of 
S1(2, R). That Lie algebra is represented here by infinitesima transformations of 
the form a + Bz + yer = 6z, the infinitesima version ata =d=1,b=c=0 
of ze aut the operation of S/(2, IR). In fact, for n,m = —1,1,0, (2.5.66) is 
the same as (1.3.48), except for the different notation, of course. In general, L, 
generates 6z = 7”*!. L, acts on a primary fiel (primary can be define by this 


relation) as 


[Ln, p(z)] = 2" (20, + (n+ DA) gC). (2.5.67) 
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There is one point here that needs clarification the relationship between the clas- 
sical energy-momentum tensor as define in Sect. 2.4, see (2.4.8), and its operator 
version. According to (2.4.8), the energy-momentum tensor is the Noether current 

associated with a variation of the inverse metric y°?: 
6S= / dxTypdy. (2.5.68) 


Quantum mechanically, we have 


Ly +5y = (DQ) y+5y exp(—S(g, yr éy)) 


= / (De),(1+ / dsT5y~) exp(-S(0. 7). 


assuming that the energy-momentum tensor incorporates both the variation of the 
action and of the measure, 


=Z,+ 2, f dxby"\(T),, 


Thus, 
1 = 
zit, = | axsy "Ts (2.5.69) 
y 
or, putting in a factor of 47 for purposes of normalization, 
: Zy = * (ri )) (2.5.70) 
Zy yy) age e 
and more generally, 
Gr 2 (ola) +: 96n))) 
@p xX oe ‘@ X. 
Zy by~"(y1) bY 1m)” i 
= (T(y1) +++ T Om) PI) +++ PAn))- (2.5.71) 


The variation of the measure then induces the central charge c in the expansion 
(2.5.63) of the operator version of the energy-momentum tensor. 


2.6 Conformal Field Theory 


2.6.1 Axioms and the Energy-Momentum Tensor 


Conformal fiel theory was introduced by several people. An early paper that was 
important for the subsequent development of the theory is [9]. A monograph devoted 
to this topic is [38]. We shall also utilize the treatments in [77] and [46]. 
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In the preceding, we have derived certain formal consequences of the functional 
integral (2.5.5). In particular, the partition function and the correlation functions 
satisfy certain relations, and from those, we have obtained the energy-momentum 
tensor. Its classical version could be identifie with a holomorphic quadratic dif- 
ferential in Sect. 2.4. One problem, however, was the definitio of the functional 
integral (2.5.5). There, we briefl discussed the mathematical definitio in terms 
of zeta functions, see (2.5.11), and the spectrum of the Laplace—Beltrami operator. 
One way to circumvent that problem is to take the indicated algebraic relations and 
holomorphicity properties as the starting point for an axiomatic theory. This is the 
idea of conformal fiel theory. 

Thus, abstract conformal fiel theory specifie for each Riemann surface © with 
a metric g a partition function Z, and correlation functions (~1 (x1) +++ @n(%n)) for 
the primary field with non-coincident x1, ...,x,. These basic data do not need any 
action or functional integral—although (2.5.5) remains a prime example. The theory 
is define in terms of symmetry properties of these correlation functions. 

Essentially, these are: 


(1) Diffeomorphism covariance: for a diffeomorphism k : 2 > &, 


Lg = Zep, (2.6.1) 
(91(K@1)) +++ Pn Kn) g = (G11) ++ Pn On) Kg: (2.6.2) 

(ii) Local conformal covariance 
Zoos = exp (se (Cae +a a(s)R(x)) ) Ze (2.6.3) 


n 
(pi (x1) ++ GnXn))eog =| [exp(—hio (xi) (pi a1) ++ Onn) g. (2.6.4) 
i=1 
Here, R(x) is the scalar curvature of (X, g), and h; is the conformal weight (see 
below) of the fiel g;, as introduced in Sect. 1.1.2; c is called the central charge of 
the theory. (For the conformal fiel theory define by (2.5.5), we have c = 1.) 

In particular, and this is the fundamental point, the quantum mechanical partition 
function is not conformally invariant, but instead transforms with a certain factor 
that depends on the central charge. 

We return to the formula (2.5.9) for the functional (2.5.14) on a Riemann surface 
for m = 0. Since G = 27a’ (—A)7!, we should have, up to a factor, 


det G = (det A)7!. 


Since, however, A has the eigenvalue 0 (Ago = 0 for a constant function go), we 
need to restrict it to the orthogonal complement of the kernel of A, that is, to the L?- 
functions g with J mu ¢dvolg(M) = 0, when definin the determinant by ¢-function 
regularization. The corresponding determinant is denoted by det’. In fact, one should 
also normalize it by the volume (area) of M. 
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Now, however, while the action is conformally invariant, the Laplace operator 
(1.1.103), (2.4.4) 
1 & 
fg 8202 


and therefore also its eigenvalues depend on the metric g and not only on its con- 
formal class. When we consider a variation g(x) eo) g(x) of the metric, we can 


compute 
5 det’(—A) 
log 
50 (x) Vol(M) 
where R(x) is the scalar curvature of g, see e.g. [38], p. 145ff. 
Denoting the partition function for the metric g by Z,, we then have 


1 
oo — 75, FO): (2.6.5) 


6 


c 
Boa oo 8le=0 = = R(x)Zy. (2.6.6) 


24m 
More generally, one define the energy-momentum tensor as an operator by 
(2.5.71), that is, 
(T, ay B (Z1,Z1)°° * Ten Bm (Zn » Zm)P(W1, WI) °° “P(Wns Wn)) 


_ 1 (45 
~ Zy 5gu1Bi (z1, Z1) Aas 5 g%mPm (Zim, Zm) 


(Zg(P(W1, tn) +++ (Wn, tn) 9). 
(2.6.7) 


Thus, as an operator, the energy-momentum tensor takes into account the variation 
of the action S and of the integration measure Dg, as at the end of Sect. 2.5.3. 
In particular 
5 


a Z (2.6.8) 
Z, 3e°F(z)* = 


(Top (Z, Z)) = 

At aconformal metric g = p|dz|’, that is, g°° =2p~*, g** = 0 = g*“, we consider 
the above variation g > e® g and obtain 

An 5 zz 22 22 

ae Zee glo=o = — 8°" (Tez) g — 28° (Tez) g — 8° (Taz) g = —4 07 (Tez) g. (2.6.9) 

From (2.6.8), (2.6.6), we then conclude 
_ c 
4p *(Ta)g = — ZR. (2.6.10) 


Since the Euclidean metric (g** = 2, g** = 0 = g**) has vanishing scalar curvature, 
we have there that 


(Tzz) = 0, (2.6.11) 
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that is, the energy-momentum tensor is traceless when the metric is Euclidean. For 
nonvanishing curvature R, however, T is no longer traceless. The trace given by 
(2.6.10) involves both the curvature and the central charge c. 

From Axiom (ii), we obtain 


= © (a2 — 2 (a,0)° 2.6.12 
= (Tez) — 75 apo = 5 Or8) > ( .O. ) 
using, for the last step, (2.4.6) and the formula 
1/020% 07g ee 
R= + —2— }) + higher-order terms in g**, 9**, (2.6.13) 
2\ az? az? 


which is valid when we vary the Euclidean metric, that is, when we have g** = 2 
(see (1.1.148)). From (2.6.1) and (2.6.12), under a holomorphic transformation 
zt> w= f(z), 


(f’ (z)) (Tww)dwaa = (Tez) ¢1(g)P2dzdz 


a ‘ 
= (Tzz) — ala 5 log f’ (Zz) — 5( 5 toe. ©) ) 
m " 2 
nn -£(G2-Y £2) 
2\ f(z) 2\ f'@) 


Cc 
= (Tz) — pit Eh (2.6.14) 


where { f; z} is the so-called Schwarzian derivative of f. So, we see here an im- 
portant difference between the classical and the quantum energy-momentum ten- 
sor. While the latter is trace-free (2.6.11) for the Euclidean metric (but not in gen- 
eral) and holomorphic (2.6.17) (below) like the former, it no longer transforms as 
a quadratic differential, but instead picks up an additional term in its transformation 
tule (2.6.14). That term depends on the central charge c of the theory. 

In order to take also variations w.r.t. g**, we now reconsider (2.6.9), (2.6.10) as 


; s = c 
= 8 (Tez) 9 — 28°" (Tez) 9 — 8°" (Tez) g = a (2.6.15) 
Next, applying ze 5p ;Zg to (2.6.15) and recalling that the background metric is 


flat that is, g°* = 2, ¢°* = 0 = 9%, as well as (2.6.13), and using (2.1.54), we obtain 


48 (2 — 21)(Tec) + 4m (Taye, Tez) = S-825(2 — 21). (2.6.16) 
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Next, diffeomorphism invariance implies that (7,,) is holomorphic, as in the classi- 
cal case, 


0z(Tzz) = 0 = 0, (Tz). (2.6.17) 
Finally, one has the OPE 
c Ds 
(TzzT 21) = + (Tz, 21) 
ZZ" 2121 Me=- 2)" (c—z) 2121 
1 
—— 0, (Tz,-,) + analytic terms in z. (2.6.18) 
2-21 


We shall now explain this in more detail. 


2.6.2 Operator Product Expansions and the Virasoro Algebra 


We take up the discussion of Sect. 2.5.3. As before in (2.5.58), we consider z 
zt+ef(z), f holomorphic. 

We apply the general Ward identity (2.5.35) for j = fT, T being the energy— 
momentum tensor in CFT, writing T(z) for T;,, 


n 


0 
ag FP @ee) + PZn)) =D ((z1) ++ 8G (Ze) ++ PEn))B(Z— Ze) (2.6.19) 
k=1 


to primary field with variation (see (2.5.60)) 


dy =hd, fy + fazg. (2.6.20) 


1 1 
6(Z—- Zk) = ~3:(——) 


and integrate 0, f5(z — zx) by parts to obtain, using that (2.6.19) holds for all (holo- 
morphic) f, and neglecting the factor zr, 


We also write 


n 


0 h 
gg (T 91)“ En)) Bt 3) tole) -+-#te0)) =0. 


Oz, + 
ie eS (z— 


(2.6.21) 

Under a holomorphic fiel f, T has to transform as 
5 pT (2) = f(2)a.T +28, f)T + ae f, (69>) 
because f transforms like z, and T transforms like (dz). As always, c is the 


central charge. 
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When we want to use y(z,) = T(z) in the preceding, we therefore have to re- 
place (2.6.20) by (2.6.22) and obtain (2.5.63), that is, 


1 2 
(T(2)T (21)) = ——2,z, (T (21)) + —— 5 (T1)) 
ZZ (z— 


2 
21) 


c 
————_ + analytic terms. (2.6.23) 
2(z-2z1)4 


This is (2.6.18). 
We also recall (2.5.57), saying that the transformation z+» z+ f(z) is generated 
by 


- dz T(2) 
Or=$ Sf (z). 


Therefore, in particular, 


[Oy, T(w)] = fdwT + 2(0n f)T + aint. 


As above, the commutator means that 


(LO, T(w)ly(z1)--- @(n)) 
d d 
= ( == -$ =) rarer ween (Zn), 
2 


C, 27 2ri 


where z lies inside C1, but outside of C2, while the zz all lie inside Co. 
Integrating this with some function f2 around the loop C2 then leads to 


d 
[On Qpl=Qtp.m1+ 34 f 5 (@2 A fe — fr82 fy). 


This then gives us the Virasoro algebra (2.5.66). 


2.6.3 Superfields 


We recall the basic transformation rules for a family of super Riemann surfaces from 
Sect. 1.5.3: 


Z= f(z) + Ok(z), 
3 = g(z) + Oh(z), (2.6.24) 


] 
fg, k, h holomorphic, a #0. 
Z 


We defin 
Dy := 9 + 08, Di, = dz. (2.6.25) 
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(D,. had been called t in Sect. 1.5.3, and later on, we shall sometimes write 6 in 
place of 0, and 6_ in place of 0.) 


The transformation law under holomorphic coordinate changes is 


Dy = (D46)D, + (Diz —-6D,6)D,’. (2.6.26) 


Superconformal means homogeneous transformation law, i.e., 
D,Z=6D,46. (2.6.27) 


This is equivalent to 


Z= f(z) + Og(z)h(z), 


~ (2.6.28) 
6 = g(z) + 6h(z) 
with 
a a 
h?(z)= a + g@) (g anticommuting). (2.6.29) 
z z 
Since D+. = 0, (2.6.27) yields 
0.2 + 60,6 = (D6) (2.6.30) 


as a compact version of the superconformal coordinate transformation rule. 


. . 1 . 
In global terms, 0 is a section of K 2, a square root of the canonical bundle of the 
underlying Riemann surface &. Such a square root of K corresponds to the choice 
of a spin structure on &. (To see this transformation behavior, put for example g = 0 


in (2.6.28). Then from (2.6.29), 6 = ,/ $40.) 
We now look at the transformation behavior of conformal (primary) superfield 


X(Z,0) = 9(Z) + OY (Z) 


of conformal weight h. Since 6 as a section of K > has conformal weight 7 this 
means that y has weight h — +, while gy has weight h. According to (2.6.30), we 
can also express the transformation law as 


X(z,0) = XG, 6)(D,6)" (2.6.31) 


(since @ has weight 5 and D has weight -3, D6 has weight 0, which it should, 
to make the transformation law consistent). Similarly, (2.5.60) becomes 


bf 9(Z) = (ha f + FO )P@), 


I (2.6.32) 
bpV(Z)= ((« = 5 af = fac) W. 
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We also have the supersymmetry transformations for an anticommuting holomor- 
phic g, 


1 
be0(z) = 584, (2.6.33) 
1 
beW(Z) = 5889 + hase, (2.6.34) 
that is, 
1 
5¢X(z,0) = (520+ +ha.e)x. (2.6.35) 
As before, we write this as a commutator with a charge 
dz 
8X = [0 X1= $ eT GX) (2.6.36) 
Cc 


for a small circle c about z. 

Here, T is the (anticommuting) generator of the superconformal algebra. From 
this, we can draw the same consequences as above. We observe that for two super- 
symmetry transformations generated by gj, go, if we put 


1 
f= 58182) (2.6.37) 

we have 
[515g], X (z, 0) = 5X (z, 4), (2.6.38) 


where + denotes the anticommutator. Thus, a supersymmetry transformation is 
a square root of a conformal transformation, as it should be according to (2.6.30). 

As in Sect. 2.4.3, with wi = Wy —iv2, W- = W1 ti and 64 = 6, +162, 6. = 
6 — i6, (alternatively, if we wished to conform to the notation in (2.6.24), we could 
write 6,6 in place of 64, 6_), we use the operators Dy = 49, + 643,, D- = _ + 
6_ dz and consider a superfiel 


1 ; 
X= $+ 5 (Wis + WO) + 5 F640 (2.6.39) 
and obtain the action 


1 
S= i 5 D-XD;Xd°xd0_d6 


: 3 a 2\ 72 
= | 5 (40.0026 — W444 - Wt FY dz. (2.6.40) 
2 dz dz 
In Sect. 2.4.3, we derived the equations of motion (2.4.62), 


D_D,X=0. (2.6.41) 
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A solution can be decomposed as 
X(z,64,2,0-) =X(z,0,) + X(Z, @_), (2.6.42) 
and we may write 
X (Zz, 04) = 9(z) + O44 (z). (2.6.43) 


The action is invariant under superconformal transformations and the corresponding 
energy—momentum tensor is 


1 
T= —5D1X0,X = Tr + 0+Ts, (2.6.44) 
with 
1 
te — swag, (2.6.45) 
1 > | 
Tz = — 7 (9) - 5 oe ow. (2.6.46) 


Tp isa section of K?, Tr one of K?, 

We consider a complex Weyl spinor y+ ona Riemann surface ©, that is, a section 
of a spin bundle K 2, a square root of the canonical bundle K, given by a spin 
structure on &. We let w_ be the complex conjugate of w+. Thus, y_ is a section 
of K2 (for the same spin structure). 

We now consider the case where © is a cylinder, with coordinates w = t + io, 
identifying o +27 with o and with t in some interval which is not further specifie 
here. As there are two different spin structures on a cylinder, we have two choices 
for identifying y at o + 27 with y ato: 


Wi(t,0 +27)=wWi(t,c0), periodic(Ramond), or 


(2.6.47) 


Wilt,o0 +27)=—wil(t,o),  antiperiodic (Neveu-Schwarz). 


These boundary conditions also arise from the following consideration. We consider 
the half cylinder where o runs from 0 to z, and we assume boundary relations 
between the holomorphic fiel y+ and the antiholomorphic fiel y_, 


wW+(0,tT)=vw_(0,t) withv=+1, 
(2.6.48) 
W+(a, T) = W_(a, T) 


where the factor +1 has been chosen w.1.o.g. in the second equation. We can then 
combine y+ and psi_ into a single field define for o € [0, 27], by putting 


Wi(o,tT)=w_Qn—-0,t) form <o <2. (2.6.49) 
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w+ then is holomorphic, because y_ was antiholomorphic. Also, 


w+(0,T) forv=1, 


wi(27,t)=p_(0,t)= —w+(0,t) forv=—l. 


(2.6.50) 


Thus, y+ is periodic (Ramond) in the firs and antiperiodic (Neveu—Schwarz) in the 
second case. 
We now map the cylinder to an annulus via 


z=e”, 


Since w+ transforms like (d w)?, we have 


aad (z)(dz)? = yor wy (dw) 4 : 


a w 
— — 7 ay ee 
dw 


When we now rotate the cylinder by 27, the factor er changes by a factor —1. 
Therefore, periodic and antiperiodic identification are exchanged, and on the annu- 
lus, we have 


with 


Ramond: w+ (e277 z) = —wW(z) (antiperiodic), 
Neveu-Schwarz:  wi(e?"'z)=wa(z) (periodic). 


We shall now expand these expressions in terms of 


Z12 = Z1 — 22 — 910, 


We obtain 


bis 1 
T (21,01) X (22, 02) = h— X (22, 02) + —— Dy. 2X (22, 92) 
ap) 2212 


6 
Rae, 0-, X (z2, 62) + regular terms, 
Z12 


cl 3 O12 1 
T(z1, 01) T (22, 02) = = =- + = -T (22, 2) + —— D3 rT (22, 02) 
6 23 2 22 2 
ae 249 Z12 


6 
+ ns 0.,T (z2, 62) + regular terms. 
Z12 
In components: 


c 1 : 
Tr (21) Ta Za) = © (21 — z2)° . (Z1 — 22) 


5 Pp (Z2) + 0,,TB(Z2) +++: , 


Z1 — 22 
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3 1 1 
Tp (21) TF (22) = = ———5 TF (2) + ——— 02, Tr (Z2) +: :- 5 
2 (z} — z2) Z1— 22 


( 


Cc 1 1 
Tr (z1) Tr (z2) = + 
6(z1-z)? 22-2 


Tp(z2)+-:-. 


We expand Tz as before and Tr as 


i ate dei k 
Tiss “Gy (Ge=2 E, me, 
FR) = 5 yz k ( k gx F(Z)z 


keZ+a 


with a = 0 corresponding to the Ramond sector and a = 5 corresponding to the 
Neveu—Schwarz sector. 
With ¢ = ic, we obtain the super Virasoro algebra 


a 
[Lin, LnJ_ = (m — n)Lintn or —(m3 _ M)dm-4ns 
8 


1 
[Lin, Gel_ = (3 = k) Gintks 


c 1 
[Gx, Gale = 2L 41 + 7 (« _ 7 Bk 


2.7 String Theory 


In conformal fiel theory, Sect. 2.6, we have kept the Riemann surface = fi ed 
and varied the metric on & only via diffeomorphisms—which left the partition and 
correlation functions invariant—and by conformal changes—which, in contrast to 
the classical case, had a nontrivial effect, the so-called conformal anomaly. In string 
theory, one also varies the Riemann surface & itself. Equivalently, as explained in 7 
in Sect. 1.4.2, we permit any variation of the metric y, including those that change 
the underlying conformal structure. Here, we can only give some glimpses of the 
theory. Fuller treatments are given in [50, 77, 87, 88] and, closest to the presentation 
here, in [62]. 

In bosonic string theory, one starts with the linear sigma model (Polyakov action) 
(2.4.7) 


S(g, Y) G71) 


and considers the functional integral 


a a e SM dgdy. (2.7.2) 


topological types 
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This means that one wishes to average over all field @ and all compact”? surfaces, 
described by their topological type (their genus) and their metric, with exponential 
weight coming from the Polyakov action. Since, as discussed, that action S(g, y) 
is invariant under diffeomorphisms and conformal changes, that is, possesses an 
infinite-dimensiona invariance group, this functional integral, as it stands, can only 
be infinit itself. Therefore, one divides out these invariances before performing the 
functional integral. As described in Sect. 1.4.2, the remaining degrees of freedom 
are the ones coming from the moduli of the underlying surface, and we are left with 
an integral over the Riemann moduli space for surfaces of given genus and a sum 
over all genera. The essential mathematical content of string theory is then to defin 
that integral in precise mathematical terms and try to evaluate it. The sum needs 
some regularization, that is, one should put in some factor «, depending on the 
genus p that goes to 0 in some appropriate manner as the genus increases. Alterna- 
tively, one should construct a common moduli space that simultaneously includes 
surfaces of all genera. Since lower-genus surfaces occur in the compactificatio of 
the moduli spaces of higher-genus ones, this seems reasonable. As discussed above 
in Sect. 1.4.2, however, the Mumford—Deligne compactificatio is not directly ap- 
propriate for this, as there the lower-genus surfaces that occur in the boundary of 
the moduli space carry marked points in addition. With each reduction of the genus, 
the number of those marked points increases by two. When we then consider sur- 
faces of some fi ed genus po in a boundary stratum of the moduli space of surfaces 
of genus p, we have 2(p — po) marked points, and this number then tends to oo 
for p — oo. Therefore, we need to resort to the Satake—Baily compactificatio de- 
scribed in Sect. 1.4.2 which does not need marked points, but is highly singular. We 
also recall from there that this compactificatio can be mapped into the Satake com- 
pactificatio of the moduli space of principally polarized Abelian varieties. Again, 
the compactificatio of that moduli space for principally polarized Abelian varieties 
of dimension p contains in its boundary the moduli spaces for the Abelian varieties 
of smaller dimension. Letting p — oo then gives some kind of universal moduli 
space for principally polarized Abelian varieties of finit dimension, and this space 
is then stratifie according to dimension. Similarly, the analogous universal moduli 
space for compact Riemann surfaces would then be stratifie according to genus. (To 
the author’s knowledge, however, this construction has never been carried through 
in detail.) 

In any case, even the integral over the moduli space for a fi ed genus leads to 
some subtleties. The reason is that while the Polyakov action S(q, y) itself is con- 
formally invariant, the measure e~°”)d@dy in (2.7.2) is not. We have seen the 
reason above from a somewhat different perspective in our discussion of quanti- 
zation of the sigma model, where we encountered additional terms in the operator 
expansions. These then led to the nontrivial central charge c of the Virasoro algebra. 
It then turns out that there are two different sources of this conformal anomaly, one 
coming from the field @ and the other from the metric y. The field are mappings 


0Since the partition function represents the amplitude of vacuum —> vacuum transitions, only 
closed surfaces are taken into account. 
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into some euclidean space R@, and we get a contribution to the conformal anomaly 
for each dimension, that is, an overall contribution proportional to d. The conformal 
anomaly coming from y is independent of the target dimension d. It then turns out 
that these two conformal anomalies cancel precisely in dimension d = 26. Mathe- 
matically, this can be explained in terms of the geometry of the Riemann moduli 
space, utilizing earlier work of Mumford [84], or with the help of the semi-infinit 
cohomology of the Virasoro algebra. In conclusion, bosonic string theory lives in a 
26-dimensional space. 

The same scheme applies in superstring theory. Here, the action is given by 
(2.4.149), 


1 - = 
S@ Wy x= 5 [200850 + ty Oa Wa + 2xav oy" Ww dpba 


1. 
+ 5 va" xa y*xp)Vdetydz!dz*, (2.7.3) 


including also the fermionic fiel yw and the gravitino x. The same quantization 
principle is applied, and the resulting dimension needed to cancel the conformal 
anomalies turns out to be d = 10. 

In order to include gravitational fields one has to consider more general targets 
than euclidean space. The appropriate target spaces are Kahler manifolds with van- 
ishing Ricci curvature. The real dimension still has to be 10. In order to make contact 
with dimension 4 of ordinary space-time, one writes such a target as a product 


R*xM (2.7.4) 


where M now is assumed to be compact (and of such a small scale that it is not di- 
rectly observable at the macroscopic level). (This vindicates the old idea of Kaluza 
described in Sect. 1.2.4 above.) The process of making some of the dimensions com- 
pact is called compactificatio in the physics literature. M then has to be a compact 
Kahler manifold with vanishing Ricci curvature, in order to obtain supersymme- 
try, of complex dimension 3, a Calabi—Yau space. In fact, by Yau’s theorem [109], 
every compact Kahler manifold with vanishing firs Chern class cj (M) carries such 
a Ricci fla metric, and this makes the methods of algebraic geometry available for 
the investigation and classificatio of such spaces. 

In order to describe the physical content of string theory, the basic object is the 
string, an open or closed curve. As it moves in space-time, it sweeps out a Riemann 
surface. In contrast to the mathematical framework just described, this Riemann 
surface will have boundaries, even in the case of a closed string when we follow 
it between two different times t; and t2. The boundaries will then correspond to 
the initial position at time ¢, and the fina position at time t2, except when the string 
only comes into existence after time ft; and ceases to exist at time f2. See [62] for the 
systematic treatment of such boundaries in string theory. For an open string, that is, 
for a curve with two endpoints moving in space-time, we obtain further boundaries 
corresponding to the trajectories of these endpoints. More generally, the movement 
of these endpoints may be confine to lower-dimensional objects in space-time that 


2.7 String Theory 207 
carry charges and that can then become objects in their own right, the D-branes*! 
firs introduced by Polchinski [86]. Symmetries between branes then led to a new 
relation between string theory and gauge theory, culminating in a conjecture of Mal- 
dacena [78]. 

In any case, when a string moves in space-time, it sweeps out a surface, and the 
basic Nambu-—Goto action of string theory was the area of that surface. Since the area 
functional is invariant under any reparametrization, it cannot be readily quantized, 
and therefore, the symmetry was reduced by considering the map that embeds the 
surface representing the moving string into space-time and the underlying metric of 
that surface as independent variables of the theory. That led to the Polyakov action 
(2.7.1), that is, the Dirichlet integral or sigma model action (2.4.7). 

According to string theory, all elementary particles are given by vibrations of 
strings. Gauge field arise from vibrations of open strings. Their endpoints repre- 
sent charged particles. For instance, when one is an electron and the other an op- 
positely charged particle, a positron, the massless vibration of the string connecting 
them represents a photon that carries the electrical force between them. Collisions 
between such particles then naturally lead to closed strings. Gravitons, that is, par- 
ticles responsible for the effects of gravity, arise from vibrations of closed strings. 
In superstring theory, both bosons and fermions are oscillations of strings. There 
are only two fundamental constants in string theory, in contrast to the proliferation 
of such constants in the standard model. These are the string tension, that is, the 
energy per unit-length of a string, the latter given in terms of the Planck length, and 
the string coupling constant, the probability for a string to break up into two pieces. 

However, superstring theory is far from being unique, and it cannot determine the 
geometry of the background space-time purely on the basis of physical principles. 
Thus, there is room for further work in superstring theory, as well as for research 
on competing theories like loop quantum gravity (that started with Ashtekar’s refor- 
mulation of Einstein’s theory of general relativity [5]) and the development of new 
ones. 


2! The “D” here stands for Dirichlet, because such types of boundary conditions are called Dirichlet 
boundary conditions in the mathematical literature. We also recall that the basic action functional 
(2.4.7), (2.7.1) is called the Dirichlet integral in the mathematical literature. This terminology was 
in fact introduced by Riemann when he systematically used variational principles in his theory of 
Riemann surfaces, see [91]. Harmonic functions are minimizers of the Dirichlet integral, and in 
this sense, string theory is a quantization of the profound ideas of Riemann. 
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